Twitter's Rec Sys, HuggingGPT, Reflexion & More

Twitter's rec system (with W&B integration) uploaded to GitHub, HuggingGPT, Reflexion, and more. There's a lot going on in ML.
Vincent Tu
Created on April 2|Last edited on April 3
Comment
﻿
Twitter's Recommender SystemThis week Twitter open-sourced a huge part of their recommender system for the Home page.
Before we dive into the full story it's worth noting that we were obviously happy, though not surprised, to see:
﻿
Our pride in being included in the code they released  aside, let's dive into what's going on. 
First, you can check out the 4 resources below (and in References), for more information on how (and why) the system works:
﻿Being More Transparent﻿
﻿Explanation of their algorithm﻿
﻿The Algorithm﻿
﻿The Model Behind the Algorithm﻿
﻿
Generally, the process is broken down into retrieving, reranking, and filtering.
RetrievingAlso called candidate sourcing in their blog post, this component pulls data from your social graph, tweet engagement, and user data. They divide this section into In-Network and Out-of-Network. 
For in-network, they use their RealGraph model which predicts likelihood of engagement between 2 users.
For out-of-network, they analyze your social graph with their GraphJet. This asks questions like: "What Tweets did the people I follow recently engage with?".
They also leverage their SimClusters model which forms communities of people where tweets are more associated with these communities if they are liked by users from that community.
RerankingA 48M neural network optimized for positive engagement (likes, retweets, etc) rerank about the roughly ~1500 possibly-recommended tweets that come through from the retrieval stage. This neural scores these tweets based on a few metrics that represent engagement.
FilteringThey filter based on a few things:
Visibility Filtering
Author Diversity
Content Balance
Feedback-based Fatigue
Social Proof
Conversations
Edited Tweets
This stage of the RecSys pipeline is a set of heuristics aimed at enforcing trust, safety, workplace appropriateness, and also diversifying content.
HuggingGPTThis paper unites HuggingFace models and ChatGPT.
ChatGPT serves as the brains and the other narrow AI models serve as workers delegated to specific tasks. The process is broken down into 4 steps:
task planning,
model selection,
model execution, and
response generation.
Given a prompt, ChatGPT must figure out how to dismantle it into a set of tasks, delegate a certain model to each task, have these task-specific models execute, then return these results.
﻿
ReflexionReflexion is an agent that allows for an LLM to self-reflect.
What does this mean? In the method they propose, for where an LLM is interacting with an environment (RL setting), the LLM can decide whether or not it should reflect on the action it takes (heuristic), and if it decides to, then it can revise its decision-making process.
﻿
The authors state their proposal is effective at pinpointing hallucinations! Check out their paper here and their blog post here.
BloombergGPTIn an unexpected but interesting turn, Bloomberg created their own LLM called BloombergGPT with 50B parameters trained on their custom dataset called FinPile which is half domain-specific (financial documents like news, press releases, etc) and half-general language. 
﻿
﻿
Their architecture is based on BLOOM. They followed the Chinchilla model scaling laws and trained on a whopping 512 40GB A100 GPUs on AWS!
﻿
 
Check out their paper here.
References﻿https://blog.twitter.com/engineering/en_us/topics/open-source/2023/twitter-recommendation-algorithm﻿
﻿https://blog.twitter.com/en_us/topics/company/2023/a-new-era-of-transparency-for-twitter﻿
﻿https://github.com/twitter/the-algorithm﻿
﻿https://github.com/twitter/the-algorithm-ml﻿
﻿HuggingGPT Paper﻿
﻿https://nanothoughts.substack.com/p/reflecting-on-reflexion﻿
﻿Reflexion Paper﻿
﻿BloombergGPT Paper﻿
﻿
Add a comment
Tags: ML News
Iterate on AI agents and models faster. Try Weights & Biases today.