Igoro's group workspace
20B_eval
What makes this group special?
Tags
Notes
Author
State
Crashed
Start time
November 24th, 2021 2:30:18 AM
Runtime
20h 39m 29s
Tracked hours
-
Run path
eleutherai/gpt-thicc/1pg2z8jn
OS
Linux-5.11.0-40-generic-x86_64-with-glibc2.10
Python version
3.8.5
Git repository
git clone https://github.com/EleutherAI/pyfra_scripts.git
Git state
git checkout -b "20B_eval" 36bdc8af77fc204c06e375cc834929a590bed8f3
Command
train_model.py
System Hardware
| CPU count | 12 |
| GPU count | 1 |
| GPU type | GeForce RTX 3080 |
W&B CLI Version
0.12.1
Group
20B_evalConfig
Config parameters are your model's inputs. Learn more
- {} 53 keys▶
- 0
- true
- true
- 1
- "mmap"
- "/mnt/ssd-1/data/pile_20B_tokenizer/pile_20B_tokenizer_text_document"
- "nccl"
- 1,000
- 10
- {} 7 keys▶
- true
- 32
- 1
- 0
- 6,144
- "small_init"
- "/mnt/ssd-1/20B_checkpoints"
- "/mnt/ssd-1/logs"
- 2
- 150,000
- "cosine"
- 2,048
- 0.0000097
- 2
- true
- "layernorm"
- 64
- 44
- {} 2 keys▶
- {} 3 keys▶
- [] 2 items▶
- 0.9
- 0.95
- 0.00000001
- 0.000097
- "Adam"
- "wang_init"
- "column"
- false
- 4
- "rotary"
- 0.25
- "/mnt/ssd-1/20B_checkpoints"
- 1,000
- true
- 2,048
- "995,4,1"
- 2
- true
- "/mnt/ssd-1/tensorboard"
- "HFTokenizer"
- 4
- 150,000
- "eleutherai"
- 0.01
- 0.01
- {} 8 keys▶
46 ... 48▶▶
Summary
Summary metrics are your model's outputs. Learn more
No summary metrics saved for this run.
Check the summary metrics documentation for more information.