Skip to main content
lkevinzc
Log in
Sign up
Zic
lkevinzc
NUS
Teams
openrlbenchmark
smu-rl-team
stlm
oat-llm
active-grpo
axon-rl
Profile
Activity
Mon
Wed
Fri
Nov
Dec
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
3 day streak
Runs
1-10
of 21
Name
Project
State
Created
Qwen3-30B-A3B-Base_rg:arc_1d
gem-tinker_train
Crashed
1 week ago
Llama-3.1-8B-Instruct_math:Math12K
gem-tinker_train
Crashed
1 week ago
Qwen3-8B-Base_game:Sudoku-v0-easy
gem-tinker_train
Crashed
1 week ago
Qwen3-8B-Base_math:Math8K-3to5
gem-tinker_train
Finished
1 week ago
Qwen3-30B-A3B-Base_math:DeepScaleR40K (Rank=1)
gem-tinker_train
Finished
1 week ago
Llama-3.1-8B-Instruct_math:Math12K
gem-tinker_train
Finished
1 week ago
Qwen3-8B-Base_math:DeepScaleR40K (Increasing resp len)
gem-tinker_train
Finished
1 week ago
Qwen3-8B-Base_rg:simple_equations
gem-tinker_train
Finished
2 weeks ago
zichen-qwen3-4b-base-math:DeepScaleR40K-OnPolicy_0929T14:20:22
gem
Crashed
2 weeks ago
zichen-qwen3-4b-base-math:DeepScaleR40K-8OffPolicy_0929T14:19:40
gem
Finished
2 weeks ago
Loading...