Skip to main content
alpha-rl
Projects
TinyZero
Reports
Log in
Sign up
Overview
Workspace
Runs
Automat.
Sweeps
Reports
Artifacts
Anyone
Anyone
jerry_wu
Reports
Created by
Created On
Last edited
TinyZero-R1-Countdown Qilong Wu
Reproduce R1 ~ RL boosts the reasoning ability of LLM and enable the 'aha' moment testing in countdown task.
2
jerry_wu
2025-01-28
7 months ago
Clone report