Qwen2.5-1.5B-Instruct with Python code interpreter
Created on February 18|Last edited on February 18
Comment
Section 1
train/rewards/format_reward
train/rewards/format_reward
train/rewards/code_reward
train/rewards/code_reward
train/completion_length
train/completion_length
lr scan
5
Add a comment
Created with ❤️ on Weights & Biases.
https://wandb.ai/huggingface/open-r1/reports/Qwen2-5-1-5B-Instruct-with-Python-code-interpreter--VmlldzoxMTQxNTA3Mg