Kastan's group workspace
gpt2_zero3
What makes this group special?
Tags
Notes
Author
State
Failed
Start time
June 22nd, 2022 4:32:19 PM
Runtime
39s
Tracked hours
36s
Run path
kastan/col_ai/u34l1jlo
OS
Linux-4.18.0-348.23.1.el8_5.x86_64-x86_64-with-glibc2.10
Python version
3.8.2
Git repository
git clone https://github.com/hpcaitech/ColossalAI-Examples.git
Git state
git checkout -b "gpt2_zero3" 0837bc67d4d7032c6c29f18339aef6dff6c7b861
Command
train_gpt.py --config gpt2_configs/gpt2_zero3.py --from_torch
System Hardware
| CPU count | 128 |
| GPU count | 4 |
| GPU type | NVIDIA A100-SXM4-80GB |
W&B CLI Version
0.12.18
Group
gpt2_zero3Config
Config parameters are your model's inputs. Learn more
- {} 17 keys▶
- "nccl"
- 2
- "gpt2_configs/gpt2_zero3.py"
- "gpt2_configs/gpt2_zero3.py"
- "/u/kastan/colossal/data/train_data_FINAL.json"
- true
- "titans.model.gpt.gpt.gpt2_small"
- null
- null
- {} 2 keys▶
- true
- "titans.model.gpt.gpt.gpt2_small"
- 60
- {} 3 keys▶
- 0.00015
- "colossalai.nn.optimizer.hybrid_adam.HybridAdam"
- 0.01
- null
- null
- 1,024
- null
- {} 2 keys▶
- {} 3 keys▶
- true
- "<colossalai.zero.shard_utils.tensor_shard_strategy.TensorShardStrategy object at 0x150e7506b340>"
- "cpu"
- {} 0 keys
Summary
Summary metrics are your model's outputs. Learn more
- {} 5 keys▶
- "Hi, we got started"
- 82.625
- 82.625
- 61.968
- 14.367
Artifact Outputs
This run produced these artifacts as outputs. Total: 1. Learn more
Type
Name
Consumer count
Loading...