Kastan's group workspace
Jul-27__21:25
What makes this group special?
Tags
gpt2_8b_2p5d_256
Notes
Tags
my_first_tag
Author
State
Crashed
Start time
July 28th, 2022 2:26:02 AM
Runtime
3h 58m 28s
Tracked hours
-
Run path
kastan/col_ai/3pb1cnzl
OS
Linux-4.18.0-305.49.1.el8_4.x86_64-x86_64-with-glibc2.28
Python version
3.9.12
Command
/u/kastanday/LLM-Distributed-Quantization/benchmarks/gpt/v2_train.py --config /u/kastanday/LLM-Distributed-Quantization/benchmarks/gpt/configs/gpt2_8b_2p5d_256.py --host gpub078 --port 29500 --world_size 32 --rank 10
System Hardware
| CPU count | 64 |
| GPU count | 4 |
| GPU type | NVIDIA A40 |
W&B CLI Version
0.12.21
Group
Jul-27__21:25Config
Config parameters are your model's inputs. Learn more
- {} 22 keys▶
- 1,280
- 1
- "/u/kastanday/LLM-Distributed-Quantization/datasets/small-gpt-dataset.json"
- {} 1 key▶
- "AMP_TYPE.NAIVE"
- "titans.model.gpt.gpt.gpt2_8B"
- "titans.model.gpt.gpt.gpt2_xl"
- 1
- 0.00015
- "./gpt2_2.5d_tp16_bs1280_lr0.00015_accum1_clip_grad1.0/"
- {} 1 key▶
- "titans.loss.lm_loss.gpt_lmloss.GPTLMLoss"
- {} 5 keys▶
- 60
- 8
- {} 2 keys▶
- 0.00015
- 0.01
- {} 2 keys▶
- 2
- {} 3 keys▶
- 1
- "2.5d"
- 16
- 1,024
- "2.5d"
- 16
- 1,280
- 50,304
- 21
- 0.01
Artifact Outputs
This run produced these artifacts as outputs. Total: 1. Learn more
Type
Name
Consumer count
Loading...