Apiche's group workspace
debug_actor_mcp2
What makes this group special?
Tags
debug_actor_mcp2/actor
Notes
Author
State
Killed
Start time
August 26th, 2025 7:03:01 PM
Runtime
1h 15m
Tracked hours
1m 25s
Run path
apiche/pipeline-rl/debug_actor_mcp2_actor
OS
Linux-5.15.0-1067-nvidia-x86_64-with-glibc2.39
Python version
CPython 3.11.11
Git repository
git clone git@github.com:ServiceNow/pipelinerl.git
Git state
git checkout -b "debug_actor_mcp2/actor" bb4d0c59128900a7f723ba0b9edd1ed7c860ddda
Command
-m pipelinerl.entrypoints.run_actor --config-dir results/debug_actor_mcp2/conf --config-name exp_config output_dir=results/debug_actor_mcp2 hydra.run.dir=results/debug_actor_mcp2/actor +me.llm_urls=http://localhost:8080
System Hardware
| CPU count | 112 |
| Logical CPU count | 224 |
| GPU count | 1 |
| GPU type | NVIDIA H100 80GB HBM3 |
W&B CLI Version
0.19.11
Group
debug_actor_mcp2Config
Config parameters are your model's inputs. Learn more
- {} 197 keys▶
- null
- 1
- 64
- 0
- 64
- 64
- "pipelinerl.domains.mcp.generate_mcp_rollout"
- 1
- 10,000,000
- "Please reason step by step, and put your final answer within \boxed{}."
- "{task}"
- 50
- 3
- "tapeagents.agent.Agent"
- 3
- "mcp_agent"
- [] 5 items▶
- true
- "You have access to the following tools: {tools_description} "
- "You have access to the following tools: {tools_description} "
- "Output only a single JSON dict. Do not repeat the last thought again. If the last action does not change the observation, do not repeat it! DO NOT OUTPUT ANYTHING BESIDES THE JSON! DO NOT PLACE ANY COMMENTS INSIDE THE JSON. It will break the system that processes the output. "
- "You are an expert AI Agent trained to assist users with complex information processing tasks. Your role is to understand user queries and respond in a helpful and accurate manner. Keep your replies concise and direct. Prioritize clarity and avoid over-elaboration. Do not express emotions or opinions about user questions. "
- "Important! Respond with the plain text, do not include any JSON or code. Do not output anything besides what I asked in this message. "
- 1
- "pipelinerl.domains.math.load_datasets"
- "actor"
- true
- null
- false
- "deepspeed_stage3_bf16"
- "pipelinerl.domains.mcp.MCPEnvironmentServer"
- 600
- "results/debug_actor_mcp2/env_server"
- "0.0.0.0"
- "pipelinerl.domains.math.MathEnvironment"
- "/home/toolkit/research-now-reasoner/pipelinerl/conf/mcp/python.json"
- 3,000
- "tapeagents.mcp.MCPEnvironment"
- [] 1 item▶
- "run_python_code"
- 32
- 1
- 7
- 78,000
- [] 0 items
- 1
- "flash_attention_2"
- 7,777
- 4
- 0
- 1
46 ... 95▶▶96 ... 145▶▶146 ... 192▶▶