Skip to main content

Levmckinney's group workspace

pythia-410m-deduped

What makes this group special?
Tags

pythia-160m-deduped-single-gpu

Notes
State
Finished
Start time
April 29th, 2023 3:49:23 AM
Runtime
8h 23m 35s
Tracked hours
8h 23m 30s
Run path
levmckinney/tuned-lens/a7wmlqz2
OS
Linux-5.15.0-69-generic-x86_64-with-glibc2.35
Python version
3.10.6
Git repository
git clone git@github.com:AlignmentResearch/tuned-lens.git
Git state
git checkout -b "pythia-160m-deduped-single-gpu" 433b352998e753a22ec0f1b0dcdc9752213c2fda
Command
-m tuned_lens.__main__ train --model.name EleutherAI/pythia-410m-deduped --data.name val.jsonl --per_gpu_batch_size=2 --wandb pythia-160m-deduped-single-gpu
System Hardware
CPU count30
Logical CPU count 30
GPU count1
GPU typeNVIDIA A10
W&B CLI Version
0.15.0
Config

Config parameters are your model's inputs. Learn more

  • {} 17 keys
    • false
    • {} 5 keys
      • {} 2 keys
        • false
        • false
      • null
      • "LossChoice.KL"
      • {} 5 keys
        • 250
        • {} 6 keys
          • null
          • 2
          • false
          • 42
          • false
          • null
          • 262,144
          • "pythia-160m-deduped-single-gpu"
          • false
        Summary

        Summary metrics are your model's outputs. Learn more

        • {} 72 keys
          • 0.3666253685951233
          • 0.294188529253006
          • 0.21356502175331116
          • 0.18856598436832428
          • 0.16113899648189545
          • 0.1872699409723282
          • 0.18308046460151672
          • 0.17345373332500458
          • 0.1625095158815384
          • 0.15996475517749786
          • 0.13301661610603333
          • 0.12773564457893372
          • 0.1276129186153412
          • 0.10227931290864944
          • 0.10428910702466963
          • 0.10334904491901398
          • 0.08965490013360977
          • 0.09269517660140993
          • 0.08181064575910568
          • 0.08016320317983627
          • 0.054338809102773666
          • 0.06413818150758743
          • 0.04949090629816055
          • 1.892978310585022
          • 1.9245588779449463
          • 1.7185463905334473
          • 1.6680587530136108
          • 1.5741310119628906
          • 1.4736759662628174
          • 1.406898856163025
          • 1.3218592405319214
          • 1.259551167488098
          • 1.212220549583435
          • 1.1607027053833008
          • 1.1177096366882324
          • 1.0058774948120115
          • 0.8567332625389099
          • 0.782529890537262
          • 0.7041617035865784
          • 0.6262714862823486
          • 0.5804436802864075
          • 0.4735834300518036
          • 0.3858170211315155
          • 0.31610462069511414
          • 0.24517187476158145
          • 0.19552823901176453
          • 46 ... 67
          • 1.658342719078064
          • 1.4119179248809814
          • 0.9917079210281372
          • 2.770873546600342
        Artifact Outputs

        This run produced these artifacts as outputs. Learn more

        Loading...