
New

Popular

Gym-μRTS: Toward Affordable Deep Reinforcement Learning Research in Real-Time Strategy Games
Costa Huang, Chris BamfordJun 14Advanced, Reinforcement Learning, Gaming, OpenAI, Experiment, Research, Github, Panels, Plots, Slider

Clear Search
English
Unboxing ChatGPT: A Deep-Dive on How This AI-Driven Chatbot Was Trained
Sundar Raman PFeb 20Large Models, Articles, Intermediate, Reinforcement Learning, GPT, OpenAI, LLM, NLP

Implementing RLHF: Learning to Summarize with trlX
Duy V. Phung, Ayush Thakur, Louis Castricato, Jonathan Tow, Alex HavrillaJan 12Articles, HuggingFace, Reinforcement Learning, NLP, Plots, Panels, Tutorial

Popular Topics
Iterate on AI agents and models faster. Try Weights & Biases today.