Upup-ashton-wang's group workspace
Tina-OpenThoughts
What makes this group special?
Tags
Tina-OpenThoughts-continue4
Notes
Author
State
Killed
Start time
April 2nd, 2025 5:50:00 PM
Runtime
14h 46m 54s
Tracked hours
14h 46m 52s
Run path
upup-ashton-wang-usc/Tina/37ymbxtt
OS
Linux-4.18.0-553.22.1.el8_10.x86_64-x86_64-with-glibc2.28
Python version
CPython 3.10.16
Command
/home1/shangsha/workspace/reasoning/reasoning-sae/./resee/post_train_hf/grpo.py --config ./recipes/DeepSeek-R1-Distill-Qwen-1.5B/grpo/model_curated_thoughts.yaml
System Hardware
CPU count | 32 |
Logical CPU count | 32 |
GPU count | 2 |
GPU type | NVIDIA A40 |
W&B CLI Version
0.19.8
Group
Tina-OpenThoughtsConfig
Config parameters are your model's inputs. Learn more
- {} 224 keys▶
- true
- "/project/neiswang_1391/shangsha/reasoning/reasoning-sae/ckpts/models/DeepSeek-R1-Distill-Qwen-1.5B/base"
- {} 6 keys▶
- false
- 0.9
- 0.999
- 0.00000001
- false
- [] 1 item▶
- "Qwen2ForCausalLM"
- 0
- false
- false
- null
- false
- null
- 0.04
- true
- false
- 151,643
- 0
- null
- null
- false
- 0
- false
- true
- null
- null
- null
- null
- null
- 1,800
- [] 0 items
- null
- null
- false
- null
- 0
- false
- false
- false
- false
- true
- false
- 0
- 151,643
- 151,936
- 0
- 200
- 0
46 ... 95▶▶96 ... 145▶▶146 ... 195▶▶196 ... 219▶▶
Summary
Summary metrics are your model's outputs. Learn more
- {} 17 keys▶
- "table-file"
- 1,590.890625
- 0.6057267389067135
- 5,003
- 0.9394959211349488
- 0.261474609375
- 0.00000041635875908134
- 0.0105
- 2.6940483450889587
- 0.6548616364598274
- 0.25
- -0.04699374921619892
- 0.921875
- -0.08707625139504671
- 0.8281250149011612
- -0.13281909190118313
- 0.9609375
Artifact Outputs
This run produced these artifacts as outputs. Total: 6. Learn more