Skip to main content

Amr-amr's group workspace

7M

What makes this group special?
Tags

1028-rmsnorm-7m

Notes
Tags
Main Results (OLMo)
Author
State
Finished
Start time
October 29th, 2024 9:06:14 PM
Runtime
58m 42s
Tracked hours
58m 40s
Run path
amr-amr/zsl/gaeywe8o
OS
Linux-5.15.0-101-generic-x86_64-with-glibc2.35
Python version
3.10.14
Git repository
git clone git@github.com:mirandrom/zsl-olmo.git
Git state
git checkout -b "1028-rmsnorm-7m" a756ab5509e5fc6cafa09b061cfe042b2d1e9fe4
Command
/home/mila/m/mirceara/proj/zsl-olmo/scripts/train.py /home/mila/m/mirceara/proj/zsl-olmo/zsl/exp/1028-rmsnorm/7m.yaml --run_name=1028-rmsnorm-7m --wandb.group=1028-rmsnorm-7m --wandb.name=1028-rmsnorm-7m --wandb.project=zsl-tmp --wandb.entity=amr-amr
System Hardware
CPU count192
Logical CPU count 192
GPU count4
GPU type[NVIDIA H100 80GB HBM3, NVIDIA H100 80GB HBM3, NVIDIA H100 80GB HBM3, NVIDIA H100 80GB HBM3]
W&B CLI Version
0.18.1
Group
7M
Config

Config parameters are your model's inputs. Learn more

  • {} 64 keys
    • null
    • 0.0001
    • 50
    • {} 3 keys
      • "inductor"
      • false
      • "default"
    • 1
    • {} 15 keys
      • null
      • 16
      • 128
      • 1
      • 128
      • "fsdp"
      • false
      • null
      • null
      • 1,000
      • false
      • -1
      • [] 0 items
        • 10
        • null
        • false
        • {} 5 keys
          • null
          • 1
          • 512
          • null
          • null
          • null
          • 262,144
          • 1
          • null
          • {} 43 keys
            • null
            • null
            • false
            • {} 11 keys
              • "amp_bf16"
              • false
              • null
              • false
              • false
              • true
              • "1028-rmsnorm-7m"
              • true
              • "/network/scratch/m/mirceara/zsl-olmo/out/1028-rmsnorm-7m"
              • 46 ... 59
              • null
              • {} 2 keys
                • "tokenizers/allenai_eleuther-ai-gpt-neox-20b-pii-special.json"
                • "right"
              • false
              • true
            Summary

            Summary metrics are your model's outputs. Learn more

            • {} 156 keys
              • 1
              • 0.00000201568332158786
              • 0.00033450601040385664
              • 0.00000000099824104538
              • 0.0019145041005685923
              • 0.00000498654480907135
              • 0.000679589225910604
              • 0.00000001433296148434
              • 0.004208014812320471
              • 0.00002593459066702053
              • 0.00398057047277689
              • 0.00000003872142073647
              • 0.016494186595082283
              • 0.00001314395376539324
              • 0.004839143715798855
              • 0.00000001013652362047
              • 0.010401347652077677
              • 0.0000009919713193085
              • 0.002857089275494218
              • 0.00000000065902761026
              • 0.003751809010282159
              • 0.00000138890209200326
              • 0.0001157048172899522
              • 0.00000000454635573632
              • 0.0006920762825757265
              • 0.00000779394213168416
              • 0.0011906663421541452
              • 0.00000001436517216291
              • 0.004090721718966961
              • 0.00001319427610724233
              • 0.002406771993264556
              • 0.00000000126285049085
              • 0.00971877109259367
              • 0.00000000045671977311
              • 0.00006489831139333546
              • 0.00000000000000000008
              • 0.00014828110579401255
              • -0.00000055249012120839
              • 0.010779142379760742
              • 0.00000000066689775924
              • 0.12801271677017212
              • -0.00000704967487763497
              • 0.0169209111481905
              • 0.00000000574509417817
              • 0.12459757924079896
              • -0.00000662631009618053
              • 46 ... 95
                96 ... 145
                146 ... 151
              • 1,342,123,884.966576
              • 21.017519184921227
              • 4.443849086761475
              • 85.10187657629092
            Artifact Inputs

            This run consumed these artifacts as inputs. Learn more

            Artifact Outputs

            This run produced these artifacts as outputs. Learn more

            Loading...