Amr-amr's group workspace
14M
What makes this group special?
Tags
1028-rmsnorm-14m
Notes
Tags
Main Results (OLMo)
State
Finished
Start time
October 29th, 2024 5:28:24 PM
Runtime
1h 35m 11s
Tracked hours
1h 35m 9s
Run path
amr-amr/zsl/cnxom6v3
OS
Linux-5.15.0-101-generic-x86_64-with-glibc2.35
Python version
3.10.14
Git repository
git clone git@github.com:mirandrom/zsl-olmo.git
Git state
git checkout -b "1028-rmsnorm-14m" a756ab5509e5fc6cafa09b061cfe042b2d1e9fe4
Command
/home/mila/m/mirceara/proj/zsl-olmo/scripts/train.py /home/mila/m/mirceara/proj/zsl-olmo/zsl/exp/1028-rmsnorm/14m.yaml --run_name=1028-rmsnorm-14m --wandb.name=1028-rmsnorm-14m --wandb.group=1028-rmsnorm-14m --wandb.project=zsl-test --wandb.entity=amr-amr
System Hardware
CPU count | 48 |
Logical CPU count | 48 |
GPU count | 2 |
GPU type | [NVIDIA L40S, NVIDIA L40S] |
W&B CLI Version
0.18.1
Group
14MConfig
Config parameters are your model's inputs. Learn more
- {} 64 keys▶
- null
- 0.0001
- 50
- {} 3 keys▶
- "inductor"
- false
- "default"
- 1
- {} 15 keys▶
- null
- 16
- 64
- 1
- 64
- "fsdp"
- false
- null
- null
- 1,000
- false
- -1
- [] 0 items
- 10
- null
- false
- {} 5 keys▶
- null
- 1
- 512
- null
- null
- null
- 262,144
- 1
- null
- {} 43 keys▶
- null
- null
- false
- {} 11 keys▶
- "amp_bf16"
- false
- null
- false
- false
- true
- "1028-rmsnorm-14m"
- true
- "/network/scratch/m/mirceara/zsl-olmo/out/1028-rmsnorm-14m"
- null
- {} 2 keys▶
- "tokenizers/allenai_eleuther-ai-gpt-neox-20b-pii-special.json"
- "right"
- false
- true
46 ... 59▶▶
Summary
Summary metrics are your model's outputs. Learn more
- {} 284 keys▶
- 0
- 0.00000037292147681001
- 0.0005581295699812472
- 0.0000000003918363134
- 0.002121459925547242
- 0.00000109987752239249
- 0.0006096184952184558
- 0.00000000638348751636
- 0.003902824828401208
- 0.00000065996698594972
- 0.00010466254025232048
- 0.00000000586207526965
- 0.0007773516117595136
- 0.0000001807002405485
- 0.00032289582304656506
- 0.00000000237784436585
- 0.0005974783562123775
- 0.00000051334831141503
- 0.0009593224385753274
- 0.00000000024961641087
- 0.004392439033836126
- 0.00000026570893396638
- 0.00011448162695160136
- 0.00000000170994018944
- 0.0006802844000048935
- 0.0000010534888588154
- 0.0007303431630134583
- 0.00000000367948160829
- 0.0023922480177134275
- 0.00000026928879037769
- 0.0003551237459760159
- 0.000000000204817871
- 0.0007447461248375475
- 0.00000031111130738282
- 0.0007353554829023778
- 0.00000000006304580469
- 0.002587259281426668
- 0.00000006392586726633
- 0.00001373622126266128
- 0.00000000176528636153
- 0.00008239412272814661
- 0.00000050253561312275
- 0.00019936390162911263
- 0.00000000227157670452
- 0.0007481653010472655
- 0.00000050667694040385
- 2,713,522,284.2025576
- 21.72151336378063
- 3.924501895904541
- 50.62785383587583
46 ... 95▶▶96 ... 145▶▶146 ... 195▶▶196 ... 245▶▶246 ... 279▶▶
Artifact Inputs
This run consumed these artifacts as inputs. Learn more
Type
Name
Consumer count
No rows found
Loading...
Artifact Outputs
This run produced these artifacts as outputs. Total: 1. Learn more
Type
Name
Consumer count
Loading...