Skip to main content
ucalyptus
Projects
grpo-april
Workspace
Log in
Sign up
Project
Workspace
Runs
Automat.
Sweeps
Reports
Artifacts
Ucalyptus's workspace
Personal workspace
Automated workspace
Changes are only visible to you.
Runs
146
Name
146 visualized
qwen-2.5-3b/test_gsm_symbolic_formatted.jsonl/
qwen-2.5-3b/test_gsm_symbolic_formatted.jsonl/
llama-3.2-3b/test_gsm_symbolic_formatted.jsonl/
llama-3.2-3b/test_gsm_symbolic_formatted.jsonl/
qwen-2.5-0.5b/test_gsm_symbolic_formatted.jsonl/
qwen-2.5-0.5b/test_gsm_symbolic_formatted.jsonl/
llama-3.2-1b/test_gsm_symbolic_formatted.jsonl/
llama-3.2-1b/test_gsm_symbolic_formatted.jsonl/
qwen-2.5-1.5b/test_gsm_symbolic_formatted.jsonl/
qwen-2.5-1.5b/test_gsm_symbolic_formatted.jsonl/
phi-4-14b/test_gsm_symbolic_formatted.jsonl/
phi-4-14b/test_gsm_symbolic_formatted.jsonl/
llama-3.2-1b/test_gsm_symbolic_formatted.jsonl/
llama-3.2-1b/test_gsm_symbolic_formatted.jsonl/
qwen-2.5-0.5b/test_gsm_symbolic_formatted.jsonl/
qwen-2.5-0.5b/test_gsm_symbolic_formatted.jsonl/
llama-3.1-8b/test_gsm_symbolic_formatted.jsonl/
llama-3.1-8b/test_gsm_symbolic_formatted.jsonl/
qwen-2.5-7b/test_gsm_symbolic_formatted.jsonl/
qwen-2.5-7b/test_gsm_symbolic_formatted.jsonl/
qwen-2.5-7b/file-5fb436eb-210e-47c1-95a7-736cbac3e5ac.jsonl/
qwen-2.5-7b/file-5fb436eb-210e-47c1-95a7-736cbac3e5ac.jsonl/
qwen-2.5-7b/file-5fb436eb-210e-47c1-95a7-736cbac3e5ac.jsonl/
qwen-2.5-7b/file-5fb436eb-210e-47c1-95a7-736cbac3e5ac.jsonl/
qwen-2.5-7b/file-5fb436eb-210e-47c1-95a7-736cbac3e5ac.jsonl/
qwen-2.5-7b/file-5fb436eb-210e-47c1-95a7-736cbac3e5ac.jsonl/
qwen-2.5-7b/file-5fb436eb-210e-47c1-95a7-736cbac3e5ac.jsonl/
qwen-2.5-7b/file-5fb436eb-210e-47c1-95a7-736cbac3e5ac.jsonl/
qwen-2.5-7b/file-5fb436eb-210e-47c1-95a7-736cbac3e5ac.jsonl/
qwen-2.5-7b/file-5fb436eb-210e-47c1-95a7-736cbac3e5ac.jsonl/
qwen-2.5-1.5b/file-b8294709-8811-4daf-a428-3ebcdad2f245.jsonl/
qwen-2.5-1.5b/file-b8294709-8811-4daf-a428-3ebcdad2f245.jsonl/
llama-3.1-8b/file-a0a24a76-633e-4b4d-a81c-63507ba389e8.jsonl/
llama-3.1-8b/file-a0a24a76-633e-4b4d-a81c-63507ba389e8.jsonl/
llama-3.1-8b/file-eb3b6be1-fa96-4be0-89c0-0c30dd44c4a0.jsonl/
llama-3.1-8b/file-eb3b6be1-fa96-4be0-89c0-0c30dd44c4a0.jsonl/
llama-3.2-1b/file-8cf52c9d-6d0d-448a-979f-844a010f7659.jsonl/
llama-3.2-1b/file-8cf52c9d-6d0d-448a-979f-844a010f7659.jsonl/
qwen-2.5-3b/file-aa9b1322-9e13-4ae9-a62a-2ff1f1bcbb25.jsonl/
qwen-2.5-3b/file-aa9b1322-9e13-4ae9-a62a-2ff1f1bcbb25.jsonl/
1-20
of 146
Previous
Next
train/rewards/string_similarity_reward
train/rewards/string_similarity_reward
Showing first 10 runs
200
400
600
800
1k
1.2k
Step
0
0.1
0.2
0.3
0.4
0.5
0.6
qwen-2.5-1.5b/file-b8294709-8811-4daf-a428-3ebcdad2f245.jsonl/
llama-3.1-8b/file-a0a24a76-633e-4b4d-a81c-63507ba389e8.jsonl/
llama-3.1-8b/file-eb3b6be1-fa96-4be0-89c0-0c30dd44c4a0.jsonl/
qwen-2.5-3b/file-aa9b1322-9e13-4ae9-a62a-2ff1f1bcbb25.jsonl/
qwen-2.5-1.5b/file-65f84d83-53c4-4f39-8def-ba822be55d73.jsonl/
qwen-2.5-7b/file-187d859f-971f-4e01-bed3-87ce30c3e534.jsonl/
qwen-2.5-3b/file-8b5318f0-71ee-4999-a0c6-1884002a82bd.jsonl/
qwen-2.5-0.5b/file-fa984e54-f0e5-4bd5-8af7-8c494bb7a21a.jsonl/
phi-4-14b/file-1951b576-6e08-497f-a6f2-27f0cd6c6473.jsonl/
llama-3.1-8b/file-064736c1-9c5d-46b6-9fd9-367e850f2084.jsonl/