Skip to main content

Levmckinney's group workspace

pythia-12b-deduped

What makes this group special?
Tags

EleutherAI/pythia-12b-deduped-1683299166

Notes
State
Finished
Start time
May 5th, 2023 3:19:42 PM
Runtime
5h 14m 37s
Tracked hours
5h 14m 29s
Run path
levmckinney/tuned-lens/ovvz4w3c
OS
Linux-5.4.0-148-generic-x86_64-with-glibc2.35
Python version
3.10.6
Command
-m tuned_lens.__main__ train --model.name EleutherAI/pythia-12b-deduped --data.name /datasets/val.jsonl --per_gpu_batch_size=1 --output /output/EleutherAI/pythia-12b-deduped-1683299166 --wandb EleutherAI/pythia-12b-deduped-1683299166 --fsdp
System Hardware
CPU count128
Logical CPU count 255
GPU count4
GPU typeNVIDIA A100-SXM4-80GB
W&B CLI Version
0.15.0
Config

Config parameters are your model's inputs. Learn more

  • {} 17 keys
    • false
    • {} 5 keys
      • {} 2 keys
        • false
        • true
      • null
      • "LossChoice.KL"
      • {} 5 keys
        • 250
        • {} 6 keys
          • "/output/EleutherAI/pythia-12b-deduped-1683299166"
          • 1
          • false
          • 42
          • false
          • null
          • 262,144
          • "EleutherAI/pythia-12b-deduped-1683299166"
          • false
        Summary

        Summary metrics are your model's outputs. Learn more

        • {} 108 keys
          • 0.015914397314190865
          • 0.013314316980540752
          • 0.011105027049779892
          • 0.009572981856763365
          • 0.008649605326354504
          • 0.008007501251995564
          • 0.006929654162377119
          • 0.006142694503068924
          • 0.004967795684933662
          • 0.0043098400346934795
          • 0.003883617464452982
          • 0.0035951549652963877
          • 0.003284243866801262
          • 0.0029783418867737055
          • 0.0026997854001820087
          • 0.002420594450086355
          • 0.002192040905356407
          • 0.0020343526266515255
          • 0.0018384386785328388
          • 0.0017588514601811769
          • 0.0017427551792934537
          • 0.0017666786443442106
          • 0.0017473469488322737
          • 0.0017323915380984545
          • 0.001731615629978478
          • 0.0017512693302705884
          • 0.0016811741515994072
          • 0.0016638393281027677
          • 0.0016301656141877174
          • 0.00157680653501302
          • 0.001570948516018689
          • 0.001520850113593042
          • 0.0014232916291803122
          • 0.001282525947317481
          • 0.001195451826788485
          • 0.775674045085907
          • 2.139585256576538
          • 1.7683042287826538
          • 1.6840815544128418
          • 1.5712485313415527
          • 1.465919852256775
          • 1.392081379890442
          • 1.2736033201217651
          • 1.1397497653961182
          • 1.0946130752563477
          • 1.0028672218322754
          • 46 ... 95
            96 ... 103
          • 0.5218281149864197
          • 0.47037163376808167
          • 0.4201824367046356
          • 1.3950512409210205
        Artifact Inputs

        This run consumed these artifacts as inputs. Learn more

        Artifact Outputs

        This run produced these artifacts as outputs. Learn more

        Loading...