Apiche's group workspace
debug_swe_aug_15
What makes this group special?
Tags
debug_swe_aug_15/actor
Notes
Author
State
Crashed
Start time
August 15th, 2025 5:05:11 PM
Runtime
11m 10s
Tracked hours
1m 54s
Run path
apiche/pipeline-rl/debug_swe_aug_15_actor
OS
Linux-5.15.0-1067-nvidia-x86_64-with-glibc2.39
Python version
CPython 3.11.11
Git repository
git clone git@github.com:ServiceNow/PipelineRL-SWE.git
Git state
git checkout -b "debug_swe_aug_15/actor" b6f797a67602e10cd6becdb5ec53d6cbfc99c38f
Command
-m pipelinerl.entrypoints.run_actor --config-dir results/debug_swe_aug_15/conf --config-name exp_config output_dir=results/debug_swe_aug_15 hydra.run.dir=results/debug_swe_aug_15/actor +me.llm_urls=http://localhost:8080
System Hardware
| CPU count | 112 |
| Logical CPU count | 224 |
| GPU count | 4 |
| GPU type | NVIDIA H100 80GB HBM3 |
W&B CLI Version
0.19.11
Group
debug_swe_aug_15Config
Config parameters are your model's inputs. Learn more
- {} 182 keys▶
- null
- 1
- 1
- 10
- 64
- 64
- "pipelinerl.swe.rollouts.generate_unified_swe_rollout"
- 1
- 50,000,000
- "{task}"
- 50
- 15,000
- 1
- "pipelinerl.swe.load_datasets.load_local_swe_dataset"
- "/mnt/llmd/data/swegym/ds"
- "/mnt/llmd/data/swebench_lite/ds"
- "actor"
- true
- null
- false
- "deepspeed_stage3_bf16"
- null
- 0
- [] 0 items
- 1
- "flash_attention_2"
- false
- "Qwen/Qwen2.5-7B-Instruct"
- true
- null
- "tapeagents.finetune.eval.dummy_eval_callback"
- ""
- true
- 513
- true
- 0.3
- "training_data"
- -1
- true
- 0.000001
- true
- 1
- 16
- false
- false
- "none"
- 7,777
- 3
- 0
- 1
46 ... 95▶▶96 ... 145▶▶146 ... 177▶▶
Summary
Summary metrics are your model's outputs. Learn more
- {} 72 keys▶
- 0
- 0
- 9
- 15.886434078216553
- 0
- 0
- 0
- 0
- 0
- 0
- 0
- 1
- 0
- 0
- 3
- 1,445
- 735.3333333333334
- 186
- 277,033.55555555556
- 0
- 63
- 7,954
- 4,004.3333333333335
- 654
- 9,061,266.888888888
- 27
- 1
- -1
- 0
- 0
- -1
- 0
- 0
- 0
- 0
- 0
- 0
- 0
- 0
- 0
- 0
- 0
- 0
- 0
- 0
- 0
- 87.84336590766907
- 0
- 0
- 0
46 ... 67▶▶
Artifact Outputs
This run produced these artifacts as outputs. Total: 1. Learn more
Type
Name
Consumer count
Loading...