Apiche's group workspace
gpt_oss
What makes this group special?
Tags
gpt_oss/actor
Notes
Author
State
Crashed
Start time
August 8th, 2025 8:29:53 PM
Runtime
4d 4h 55m 35s
Tracked hours
1d 16h 28m 33s
Run path
apiche/pipeline-rl/gpt_oss_actor
OS
Linux-5.15.0-1067-nvidia-x86_64-with-glibc2.39
Python version
CPython 3.11.11
Git repository
git clone git@github.com:ServiceNow/pipelinerl.git
Git state
git checkout -b "gpt_oss/actor" a634b394192af0d1b77b159c5e54a44140c47dfe
Command
-m pipelinerl.entrypoints.run_actor --config-dir results/gpt_oss/conf --config-name exp_config output_dir=results/gpt_oss hydra.run.dir=results/gpt_oss/actor +me.llm_urls=http://localhost:8080+http://localhost:8081+http://localhost:8082+http://localhost:8083
System Hardware
| CPU count | 112 |
| Logical CPU count | 224 |
| GPU count | 4 |
| GPU type | NVIDIA H100 80GB HBM3 |
W&B CLI Version
0.19.11
Group
gpt_ossConfig
Config parameters are your model's inputs. Learn more
- {} 172 keys▶
- null
- 1
- 64
- 0
- 64
- 64
- "pipelinerl.domains.math.generate_math_rollout"
- 1
- 10,000,000
- "Please reason step by step, and put your final answer within \boxed{}."
- "{task}"
- 50
- 1
- "pipelinerl.domains.math.load_datasets"
- "actor"
- true
- null
- false
- "deepspeed_stage3_bf16"
- 128
- "pipelinerl.domains.math.MathEnvironment"
- 78,000
- [] 0 items
- 1
- "flash_attention_2"
- false
- "/mnt/llmd/base_models/Qwen2.5-0.5B"
- true
- null
- 128
- "tapeagents.finetune.eval.dummy_eval_callback"
- ""
- true
- 512
- true
- 0.3
- "training_data"
- -1
- true
- 0.000001
- true
- 1
- 16
- false
- false
- "none"
- 7,777
- 0
- 0
- 1
46 ... 95▶▶96 ... 145▶▶146 ... 167▶▶
Summary
Summary metrics are your model's outputs. Learn more
- {} 45 keys▶
- 0
- 0
- 0
- 0
- 478
- 3
- 4.1094818115234375
- 0
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1,329
- 0
- 0
- 532
- 0
- 0
- 1
- 1
- 1
- 794
- 0
- 0
- 80
- 0
- 0
- 1,329
- 1,116.456038770662
- 0
- 0
- 0
- 532
- 128.4371061274513
- 478
- 0
- 0
- 1.0710232332175724
- 0
- 0
- 532.8619978427887
- 1,244.8931448981132
Artifact Outputs
This run produced these artifacts as outputs. Total: 1. Learn more
Type
Name
Consumer count
Loading...