Skip to main content

Apiche's group workspace

test_cumulative_time763765

What makes this group special?
Tags

test_cumulative_time763765/actor

Notes
Author
State
Crashed
Start time
August 7th, 2025 5:45:23 PM
Runtime
20h 34m 28s
Tracked hours
3m 36s
Run path
apiche/pipeline-rl/test_cumulative_time763765_actor
OS
Linux-5.15.0-1067-nvidia-x86_64-with-glibc2.39
Python version
CPython 3.11.13
Git repository
git clone git@github.com:ServiceNow/pipelinerl.git
Git state
git checkout -b "test_cumulative_time763765/actor" b8e17624aa0aa367f58153ee8057bed72425e39f
Command
-m pipelinerl.entrypoints.run_actor --config-dir results/test_cumulative_time763765/conf --config-name exp_config output_dir=results/test_cumulative_time763765 hydra.run.dir=results/test_cumulative_time763765/actor +me.llm_urls=http://localhost:8080+http://localhost:8081
System Hardware
CPU count112
Logical CPU count 224
GPU count4
GPU typeNVIDIA H100 80GB HBM3
W&B CLI Version
0.20.1
Config

Config parameters are your model's inputs. Learn more

  • {} 169 keys
    • null
    • 1
    • 64
    • 0
    • 64
    • 64
    • "pipelinerl.domains.math.generate_math_rollout"
    • 1
    • 10,000,000
    • "Please reason step by step, and put your final answer within \boxed{}."
    • "{task}"
    • 50
    • 1
    • "pipelinerl.domains.math.load_datasets"
    • "actor"
    • true
    • null
    • false
    • "deepspeed_stage3_bf16"
    • "pipelinerl.domains.math.MathEnvironment"
    • 78,000
    • [] 0 items
      • 1
      • "flash_attention_2"
      • false
      • "Qwen/Qwen2.5-1.5B-Instruct"
      • true
      • null
      • "tapeagents.finetune.eval.dummy_eval_callback"
      • ""
      • true
      • 512
      • true
      • 0.3
      • "training_data"
      • -1
      • true
      • 0.000001
      • true
      • 1
      • 16
      • false
      • false
      • "none"
      • 0.05
      • false
      • 46 ... 95
        96 ... 145
        146 ... 164
      • 7,777
      • 4
      • 0
      • 1
    Summary

    Summary metrics are your model's outputs. Learn more

    • {} 41 keys
      • 1
      • 512
      • 32.498846769332886
      • 0
      • 0
      • 0
      • 1
      • 1
      • 0
      • 1
      • 1
      • 2,839
      • 0
      • 0
      • 94
      • 1
      • 1
      • 0
      • 1
      • 1
      • 1,339
      • 0
      • 0
      • 74
      • 1
      • 1
      • 2,839
      • 9,372.98867525801
      • 0
      • 0
      • 0
      • 94
      • 1,026.1205763769249
      • 512
      • 0
      • 1
      • 9.267707517855174
      • 0
      • 1
      • 47.02703905105591
      • 10,399.109251634934
    Artifact Outputs

    This run produced these artifacts as outputs. Total: 1. Learn more