Kastan's group workspace
gpt2_3d
What makes this group special?
Tags
Notes
Tags
delta
gpt2_8B
Author
State
Finished
Start time
June 25th, 2022 4:11:35 AM
Runtime
4m 1s
Tracked hours
3m 59s
Run path
kastan/col_ai/tvn44a8j
OS
Linux-4.18.0-305.49.1.el8_4.x86_64-x86_64-with-glibc2.17
Python version
3.8.13
Git repository
git clone https://github.com/hpcaitech/ColossalAI-Examples.git
Git state
git checkout -b "gpt2_3d" 45b6e9cc24b57aed4c5cf2c9278277e74cdef541
Command
train_gpt.py --config gpt2_configs/gpt2_3d.py --from_torch
System Hardware
| CPU count | 128 |
| GPU count | 8 |
| GPU type | NVIDIA A100-SXM4-40GB |
W&B CLI Version
0.12.19
Group
gpt2_3dConfig
Config parameters are your model's inputs. Learn more
- {} 23 keys▶
- "nccl"
- 4
- "gpt2_configs/gpt2_3d.py"
- "gpt2_configs/gpt2_3d.py"
- "/u/kastanday/colossal_ai/raw_json_backup/train_data_FINAL.json"
- {} 1 key▶
- "AMP_TYPE.NAIVE"
- false
- true
- "titans.model.gpt.gpt.gpt2_8B"
- "titans.model.gpt.gpt.gpt2_small"
- "titans.model.gpt.gpt.gpt2_xl"
- null
- null
- {} 1 key▶
- "titans.loss.lm_loss.gpt_lmloss.GPTLMLoss"
- {} 2 keys▶
- true
- "titans.model.gpt.gpt.gpt2_xl"
- 60
- {} 3 keys▶
- 0.00015
- "torch.optim.adam.Adam"
- 0.01
- {} 2 keys▶
- 1
- {} 2 keys▶
- "3d"
- 8
- null
- null
- 1,024
- 8
- null
Summary
Summary metrics are your model's outputs. Learn more
- {} 2 keys▶
- "Hi, we got started"
- 7.564544677734375
Artifact Outputs
This run produced these artifacts as outputs. Total: 1. Learn more
Type
Name
Consumer count
Loading...