Skip to main content
ucalyptus
Projects
gsm_hard_code_syntax
Log in
Sign up
Project
Workspace
Runs
Automat.
Sweeps
Reports
Artifacts
Ucalyptus's workspace
Personal workspace
Automated workspace
Changes are only visible to you.
Runs
18
Name
18 visualized
/data/grpo-folder/weights/qwen-2.5-1.5b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/qwen-2.5-1.5b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.2-3b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.2-3b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/qwen-2.5-3b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/qwen-2.5-3b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/qwen-2.5-7b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/qwen-2.5-7b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.2-3b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.2-3b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/phi-4-14b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/phi-4-14b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.2-1b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.2-1b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.2-1b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.2-1b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.2-1b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.2-1b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/qwen-2.5-7b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/qwen-2.5-7b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/qwen-2.5-1.5b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/qwen-2.5-1.5b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/phi-4-14b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/phi-4-14b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.1-8b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.1-8b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/qwen-2.5-1.5b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/qwen-2.5-1.5b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/qwen-2.5-3b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/qwen-2.5-3b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.2-1b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.2-1b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/qwen-2.5-1.5b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/qwen-2.5-1.5b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.1-8b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.1-8b/train_gsm_hard.jsonl/
1-18
of 18
Previous
Next
train/reward
train/reward
Showing first 10 runs
50
100
150
200
250
train/global_step
0
0.5
1
1.5
/data/grpo-folder/weights/qwen-2.5-1.5b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.2-3b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/qwen-2.5-3b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/qwen-2.5-7b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.2-3b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/phi-4-14b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.2-1b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.2-1b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/llama-3.2-1b/train_gsm_hard.jsonl/
/data/grpo-folder/weights/qwen-2.5-7b/train_gsm_hard.jsonl/