Apiche's group workspace
debug_gspo13
What makes this group special?
Tags
debug_gspo13/finetune
Notes
Author
State
Crashed
Start time
September 28th, 2025 2:42:50 AM
Runtime
1h 40m 49s
Tracked hours
5m 1s
Run path
apiche/pipeline-rl/debug_gspo13_finetune
OS
Linux-5.15.0-1067-nvidia-x86_64-with-glibc2.39
Python version
CPython 3.11.11
Git repository
git clone git@github.com:ServiceNow/pipelinerl.git
Git state
git checkout -b "debug_gspo13/finetune" 977ccd227a5ec4bbb386300e93092b8194c2af47
Command
pipelinerl/entrypoints/run_finetune.py --config-dir results/debug_gspo13/conf --config-name exp_config output_dir=results/debug_gspo13 hydra.run.dir=results/debug_gspo13/finetune +me.weight_update_group_init_method=tcp://localhost:9000 +me.weight_update_group_world_size=5 +me.llm_urls=http://localhost:8080+http://localhost:8081+http://localhost:8082+http://localhost:8083
System Hardware
| CPU count | 112 |
| Logical CPU count | 224 |
| GPU count | 8 |
| GPU type | NVIDIA H100 80GB HBM3 |
W&B CLI Version
0.19.11
Group
debug_gspo13Config
Config parameters are your model's inputs. Learn more
- {} 185 keys▶
- "False"
- "no"
- null
- 1
- 64
- 0
- 64
- 64
- "pipelinerl.domains.math.generate_math_rollout"
- 1
- 10,000,000
- "Please reason step by step, and put your final answer within \boxed{}."
- ""
- 50
- 1
- "nccl"
- "pipelinerl.domains.math.load_datasets"
- "False"
- ""
- true
- null
- false
- "deepspeed_stage3_bf16"
- "DeepSpeedPlugin(hf_ds_config=<accelerate.utils.deepspeed.HfDeepSpeedConfig object at 0x7ffc246e1fd0>, gradient_accumulation_steps=1, gradient_clipping='auto', zero_stage=3, is_train_batch_min=True, offload_optimizer_device='none', offload_param_device='none', offload_optimizer_nvme_path='none', offload_param_nvme_path='none', zero3_init_flag=True, zero3_save_16bit_model=True, transformer_moe_cls_names=None, enable_msamp=False, msamp_opt_level='O1')"
- "cuda:0"
- "DistributedType.DEEPSPEED"
- "TorchDynamoPlugin(backend=<DynamoBackend.NO: 'NO'>, mode='default', fullgraph=False, dynamic=None, options=None, disable=False, use_regional_compilation=False)"
- "pipelinerl.domains.math.MathEnvironment"
- 0
- [] 0 items
- 1
- "flash_attention_2"
- false
- "Qwen/Qwen2.5-0.5B"
- true
- null
- "tapeagents.finetune.eval.dummy_eval_callback"
- ""
- true
- 16
- true
- 0.3
- "training_data"
- -1
- true
- 0.000001
- 7,777
- 1
- 0
- 1
46 ... 95▶▶96 ... 145▶▶146 ... 180▶▶
Summary
Summary metrics are your model's outputs. Learn more
- {} 66 keys▶
- 3.743261814117432
- 0
- 0
- 4.092408180236816
- 0
- 0.6677849530773315
- 105.75
- 5.919403553009033
- 0
- 0.4847049117088318
- -182.7752227783203
- 3.953125
- 69.46488189697266
- -45.54624938964844
- 0
- 0.0625
- 2.890625
- 0.0005342960357666016
- -45.8702392578125
- 0
- 0.0625
- -4.686008930206299
- 40
- -4.1688618659973145
- 0.9675576686859132
- 55.71575927734375
- 38.577789306640625
- 7.436551094055176
- 1
- -4.1688618659973145
- 0
- 0.0625
- 17.377090454101562
- -2.890625
- -3.743261814117432
- -3.953125
- 14.041807174682615
- 139
- 0
- 501.21562853260406
- 288
- 0.000001
- 1,920
- 1,920
- 155
- 1
- 423
- 2,872.9807568211795
- 1,692
- 718.2451892052949
46 ... 61▶▶
Artifact Outputs
This run produced these artifacts as outputs. Total: 1. Learn more
Type
Name
Consumer count
Loading...