Amr-amr's group workspace
285M
What makes this group special?
Tags
1028-rmsnorm-285m
Notes
Tags
Main Results (OLMo)
State
Finished
Start time
November 2nd, 2024 8:36:11 PM
Runtime
9h 23m 9s
Tracked hours
9h 23m 8s
Run path
amr-amr/zsl/ng3zi0sb
OS
Linux-5.15.0-101-generic-x86_64-with-glibc2.35
Python version
3.10.14
Git repository
git clone git@github.com:mirandrom/zsl-olmo.git
Git state
git checkout -b "1028-rmsnorm-285m" a5d85f50b6398187a0f97353c8dff998342638b3
Command
/home/mila/m/mirceara/proj/zsl-olmo/scripts/train.py /home/mila/m/mirceara/proj/zsl-olmo/zsl/exp/1028-rmsnorm/285m.yaml --run_name=1028-rmsnorm-285m --wandb.name=1028-rmsnorm-285m --wandb.group=1028-rmsnorm-285m --wandb.project=zsl-test --wandb.entity=amr-amr
System Hardware
CPU count | 48 |
Logical CPU count | 48 |
GPU count | 2 |
GPU type | [NVIDIA L40S, NVIDIA L40S] |
W&B CLI Version
0.18.1
Group
285MConfig
Config parameters are your model's inputs. Learn more
- {} 64 keys▶
- "fine_grained"
- 0.0001
- 50
- {} 3 keys▶
- "inductor"
- false
- "default"
- 1
- {} 15 keys▶
- null
- 16
- 32
- 1
- 32
- "fsdp"
- false
- null
- null
- 1,000
- false
- -1
- [] 0 items
- 10
- null
- false
- {} 5 keys▶
- false
- 1
- 512
- null
- null
- null
- 262,144
- 1
- null
- {} 43 keys▶
- null
- null
- false
- {} 11 keys▶
- "amp_bf16"
- false
- null
- false
- false
- true
- "1028-rmsnorm-285m"
- true
- "/network/scratch/m/mirceara/zsl-olmo/out/1028-rmsnorm-285m"
- null
- {} 2 keys▶
- "tokenizers/allenai_eleuther-ai-gpt-neox-20b-pii-special.json"
- "right"
- false
- true
46 ... 59▶▶
Summary
Summary metrics are your model's outputs. Learn more
- {} 1052 keys▶
- 0
- 0.00000000124075749675
- 0.00000797734355728608
- 0.00000000000327517354
- 0.00003849484710372053
- 0.00000000248471376807
- 0.00000684986480337102
- 0.00000000002091148088
- 0.00004172179978922941
- 0.00000000054179766229
- 0.00000404886623073253
- 0.00000000000384256906
- 0.00000768183781474363
- 0.00000000104010233759
- 0.00003091021062573418
- 0.00000000000025768192
- 0.0000691653840476647
- 0.00000000097136720889
- 0.00005671779945259914
- 0.00000000000064649306
- 0.0001834071590565145
- 0.00000000019065145296
- 0.00000033353191497554
- 0.00000000001444518116
- 0.00000127944497307908
- 0.00000000018286858139
- 0.00000062301023717737
- 0.00000000001192923017
- 0.00000095932284693845
- 0.00000000057512383744
- 0.00001250297373189824
- 0.00000000000148823586
- 0.00003772940544877201
- 0.00000000100016050997
- 0.00010575064516160636
- 0.0000000000011034692
- 0.0002198048168793321
- 0.00000000012454764997
- 0.00000060745816199415
- 0.0000000000167108688
- 0.00000123838196941506
- 0.00000000112060360991
- 0.00000451339747087332
- 0.00000000002819950523
- 0.00003103411290794611
- 0.00000000036945391191
- 54,862,691,626.57835
- 24.7280993848064
- 2.908607482910156
- 18.331254200909473
46 ... 95▶▶96 ... 145▶▶146 ... 195▶▶196 ... 245▶▶246 ... 295▶▶296 ... 345▶▶346 ... 395▶▶396 ... 445▶▶446 ... 495▶▶496 ... 545▶▶546 ... 595▶▶596 ... 645▶▶646 ... 695▶▶696 ... 745▶▶746 ... 795▶▶796 ... 845▶▶846 ... 895▶▶896 ... 945▶▶946 ... 995▶▶996 ... 1045▶▶1046 ... 1047▶▶
Artifact Inputs
This run consumed these artifacts as inputs. Learn more
Type
Name
Consumer count
No rows found
Loading...
Artifact Outputs
This run produced these artifacts as outputs. Total: 1. Learn more
Type
Name
Consumer count
Loading...