Skip to main content

New

Popular

Training Reproducible Robots with W&B
Armand du Parc Locmaria
Nov 17
Intermediate, Reinforcement Learning, W&B Meta, Artifacts, Plots
Gym-μRTS: Toward Affordable Deep Reinforcement Learning Research in Real-Time Strategy Games
Costa Huang, Chris Bamford
Jun 14
Advanced, Reinforcement Learning, Gaming, OpenAI, Experiment, Research, Github, Panels, Plots, Slider
Clear Search
English
Introducing Serverless RL
Kyle Corbitt
Oct 08
Articles, Reinforcement Learning
What is RLHF? Reinforcement learning from human feedback for AI alignment
Brett Young
Jul 28
Articles, Reinforcement Learning, GenAI, LLM, Evaluations, Tutorial
Getting started with the Agent Reinforcement Trainer (ART)
Brett Young
Jul 23
Articles, Reinforcement Learning, GenAI, Weave, Agents
Types of reinforcement learning algorithms
Brett Young
Jul 01
Articles, Reinforcement Learning, Agents, GenAI
Reinforcement learning for reasoning: Enhancing AI capabilities
Brett Young
Jun 25
Articles, Reinforcement Learning, Agents, Weave, GenAI
Exploring multi-agent reinforcement learning (MARL)
Brett Young
May 22
Articles, Reinforcement Learning, Agents, Evaluations, Panels
Getting started with reinforcement learning (with a Python tutorial)
Brett Young
May 02
Reinforcement Learning, Tutorial, Panels, Articles
Supervised learning vs deep learning vs reinforcement learning
Atharva Ingle
Apr 11
Articles, Reinforcement Learning, LLM, Beginner
Getting Started with Deep Q-Learning
Brett Young
Mar 18
Articles, Reinforcement Learning
Q-Learning: Implementation
Piyush Thakur
Jan 19
Articles, Reinforcement Learning, Intermediate, Tutorial
What is Q-Learning?
Piyush Thakur
Dec 01
Articles, Reinforcement Learning, Beginner
Fundamentals of Reinforcement Learning with Example Code
Mukilan Krishnakumar
Nov 04
Articles, Reinforcement Learning, Domain Agnostic, Beginner
Replit Hackathon Winner: Implementing Q-Learning from Scratch
Icemaster Eric
Mar 08
Articles, Reinforcement Learning, Community Posts, Gaming, Experiment
A Gentle Introduction to OpenAI Gym
Mukilan Krishnakumar
Feb 22
Articles, Beginner, Reinforcement Learning
Unboxing ChatGPT: A Deep-Dive on How This AI-Driven Chatbot Was Trained
Sundar Raman P
Feb 20
Large Models, Articles, Intermediate, Reinforcement Learning, GPT, OpenAI, LLM, NLP
An Introduction to Training LLMs Using Reinforcement Learning From Human Feedback (RLHF)
Ayush Thakur
Jan 30
Articles, Reinforcement Learning, Beginner, NLP, Large Models, LLM
Implementing RLHF: Learning to Summarize with trlX
Duy V. Phung, Ayush Thakur, Louis Castricato, Jonathan Tow, Alex Havrilla
Jan 12
Articles, HuggingFace, Reinforcement Learning, NLP, Plots, Panels, Tutorial
RLHF: Hyperparameter Optimization for trlX
Ayush Thakur
Nov 30
Articles, Advanced, Sweeps, Plots, Reinforcement Learning
Iterate on AI agents and models faster. Try Weights & Biases today.