Amr-amr's group workspace
78M
What makes this group special?
Tags
1028-rmsnorm-78m
Notes
Tags
Main Results (OLMo)
Author
State
Finished
Start time
October 29th, 2024 9:53:26 PM
Runtime
8h 11m 11s
Tracked hours
8h 11m 10s
Run path
amr-amr/zsl/7t13yq2u
OS
Linux-5.15.0-101-generic-x86_64-with-glibc2.35
Python version
3.10.14
Git repository
git clone git@github.com:mirandrom/zsl-olmo.git
Git state
git checkout -b "1028-rmsnorm-78m" a756ab5509e5fc6cafa09b061cfe042b2d1e9fe4
Command
/home/mila/m/mirceara/proj/zsl-olmo/scripts/train.py /home/mila/m/mirceara/proj/zsl-olmo/zsl/exp/1028-rmsnorm/78m.yaml --run_name=1028-rmsnorm-78m --wandb.name=1028-rmsnorm-78m --wandb.group=1028-rmsnorm-78m --wandb.project=zsl-test --wandb.entity=amr-amr
System Hardware
CPU count | 48 |
Logical CPU count | 48 |
GPU count | 2 |
GPU type | [NVIDIA L40S, NVIDIA L40S] |
W&B CLI Version
0.18.1
Group
78MConfig
Config parameters are your model's inputs. Learn more
- {} 64 keys▶
- null
- 0.0001
- 50
- {} 3 keys▶
- "inductor"
- false
- "default"
- 1
- {} 15 keys▶
- null
- 16
- 32
- 1
- 32
- "fsdp"
- false
- null
- null
- 1,000
- false
- -1
- [] 0 items
- 10
- null
- false
- {} 5 keys▶
- null
- 1
- 512
- null
- null
- null
- 262,144
- 1
- null
- {} 43 keys▶
- null
- null
- false
- {} 11 keys▶
- "amp_bf16"
- false
- null
- false
- false
- true
- "1028-rmsnorm-78m"
- true
- "/network/scratch/m/mirceara/zsl-olmo/out/1028-rmsnorm-78m"
- null
- {} 2 keys▶
- "tokenizers/allenai_eleuther-ai-gpt-neox-20b-pii-special.json"
- "right"
- false
- true
46 ... 59▶▶
Summary
Summary metrics are your model's outputs. Learn more
- {} 796 keys▶
- 0
- 0.0000000042812113854
- 0.00001729361974867061
- 0.00000000000969423153
- 0.00012260410585440695
- 0.0000000082338509344
- 0.00000583794553676853
- 0.00000000010502651526
- 0.00004766496931551955
- 0.00000000186995552376
- 0.00000147871105582453
- 0.00000000012757293832
- 0.00000433032573710079
- 0.00000000171958136619
- 0.0000225899839279009
- 0.00000000004731354197
- 0.00005257746306597255
- 0.00000000588923487754
- 0.00010391221439931542
- 0.0000000000075972284
- 0.0004308274365030229
- 0.00000000058071242259
- 0.00000028970654852856
- 0.00000000004596043685
- 0.0000011099323273811
- 0.00000000065398819693
- 0.0000001208015589782
- 0.00000000010687709051
- 0.00000064599282723066
- 0.00000000177133585577
- 0.000090082197857555
- 0.00000000004575950729
- 0.000101468940556515
- 0.00000000745202033414
- 0.0001638083631405607
- 0.00000000000149296894
- 0.0005756734753958881
- 0.00000000059414101417
- 0.00000042839280922635
- 0.00000000005508141859
- 0.00000124257860534271
- 0.00000000941686906231
- 0.00007480040949303657
- 0.0000000001108010142
- 0.00012174207222415134
- 0.00000000200989691557
- 18,230,140,789.84382
- 23.626342148582687
- 3.1946680545806885
- 24.402072191655765
46 ... 95▶▶96 ... 145▶▶146 ... 195▶▶196 ... 245▶▶246 ... 295▶▶296 ... 345▶▶346 ... 395▶▶396 ... 445▶▶446 ... 495▶▶496 ... 545▶▶546 ... 595▶▶596 ... 645▶▶646 ... 695▶▶696 ... 745▶▶746 ... 791▶▶
Artifact Inputs
This run consumed these artifacts as inputs. Learn more
Type
Name
Consumer count
No rows found
Loading...
Artifact Outputs
This run produced these artifacts as outputs. Total: 1. Learn more
Type
Name
Consumer count
Loading...