Amr-amr's group workspace
472M
What makes this group special?
Tags
1028-rmsnorm-472m.6515011
Notes
Tags
Main Results (OLMo)
Author
State
Finished
Start time
April 3rd, 2025 3:28:01 AM
Runtime
1d 20h 3m 59s
Tracked hours
-
Run path
amr-amr/zsl/0dvyk2gc
OS
Linux-5.15.0-101-generic-x86_64-with-glibc2.35
Python version
CPython 3.10.14
Git repository
git clone git@github.com:mirandrom/zsl-olmo.git
Git state
git checkout -b "1028-rmsnorm-472m.6515011" e5db42f54d08fb0b5412ee6268cd20471c7a7467
Command
/home/mila/m/mirceara/proj/zsl-olmo/scripts/train.py /home/mila/m/mirceara/proj/zsl-olmo/zsl/exp/1028-rmsnorm/472m.yaml --run_name=1028-rmsnorm-472m --wandb.name=1028-rmsnorm-472m.6515011 --wandb.group=1028-rmsnorm-472m --wandb.project=zsl-test --wandb.entity=amr-amr
System Hardware
CPU count | 48 |
Logical CPU count | 48 |
GPU count | 2 |
GPU type | NVIDIA L40S |
W&B CLI Version
0.19.8
Group
472MConfig
Config parameters are your model's inputs. Learn more
- {} 64 keys▶
- "fine_grained"
- 0.0001
- 50
- {} 3 keys▶
- "inductor"
- false
- "default"
- 1
- {} 16 keys▶
- null
- 16
- 32
- 1
- 32
- "fsdp"
- false
- null
- null
- 1,000
- false
- -1
- [] 0 items
- 10
- null
- false
- {} 5 keys▶
- false
- 1
- 512
- null
- null
- null
- 262,144
- 1
- null
- {} 43 keys▶
- null
- null
- false
- {} 11 keys▶
- "amp_bf16"
- false
- null
- false
- false
- true
- "1028-rmsnorm-472m"
- true
- "/network/scratch/m/mirceara/zsl-olmo/out/1028-rmsnorm-472m"
- null
- {} 2 keys▶
- "tokenizers/allenai_eleuther-ai-gpt-neox-20b-pii-special.json"
- "right"
- false
- true
46 ... 59▶▶
Summary
Summary metrics are your model's outputs. Learn more
- {} 1052 keys▶
- 0
- 0.00000000023884794143
- 0.00000143017314258032
- 0.00000000000211669571
- 0.00001275107206311077
- 0.00000000060622168396
- 0.00000167225766745105
- 0.00000000001333429027
- 0.00001206663364428096
- 0.0000000001437746866
- 0.00000009938728595671
- 0.00000000000067348357
- 0.00000035226196359872
- 0.00000000025963503569
- 0.000014813716916251
- 0.00000000000040103871
- 0.00003648872734629549
- 0.00000000045482997924
- 0.00007205313886515796
- 0.00000000000037520933
- 0.00017947613378055394
- 0.00000000008024005765
- 0.00000007760765896592
- 0.00000000000592391414
- 0.00000045021175765214
- 0.00000000105910558101
- 0.00000605906052442151
- 0.00000000001940792839
- 0.00004622575579560362
- 0.00000000018598234064
- 0.00000935277421376668
- 0.00000000000460230595
- 0.0000203907438844908
- 0.0000000003436908258
- 0.00001662587055761833
- 0.00000000000007755347
- 0.00007962677045725286
- 0.00000000005725919391
- 0.00000005789939194756
- 0.00000000000753307052
- 0.00000020600243999525
- 0.00000000033273184208
- 0.00000152882000747923
- 0.00000000001946807819
- 0.00001118051386583829
- 0.00000000017726416168
- 77,906,238,128.02629
- 25.07877186528228
- 2.8206686973571777
- 16.78807305961834
46 ... 95▶▶96 ... 145▶▶146 ... 195▶▶196 ... 245▶▶246 ... 295▶▶296 ... 345▶▶346 ... 395▶▶396 ... 445▶▶446 ... 495▶▶496 ... 545▶▶546 ... 595▶▶596 ... 645▶▶646 ... 695▶▶696 ... 745▶▶746 ... 795▶▶796 ... 845▶▶846 ... 895▶▶896 ... 945▶▶946 ... 995▶▶996 ... 1045▶▶1046 ... 1047▶▶
Artifact Outputs
This run produced these artifacts as outputs. Total: 1. Learn more
Type
Name
Consumer count
Loading...