Skip to main content

Igoro's group workspace

20B_eval

What makes this group special?
Tags
Notes
Author
State
Crashed
Start time
November 24th, 2021 2:30:18 AM
Runtime
20h 39m 29s
Tracked hours
-
Run path
eleutherai/gpt-thicc/1pg2z8jn
OS
Linux-5.11.0-40-generic-x86_64-with-glibc2.10
Python version
3.8.5
Git repository
git clone https://github.com/EleutherAI/pyfra_scripts.git
Git state
git checkout -b "20B_eval" 36bdc8af77fc204c06e375cc834929a590bed8f3
Command
train_model.py
System Hardware
CPU count12
GPU count1
GPU typeGeForce RTX 3080
W&B CLI Version
0.12.1
Config

Config parameters are your model's inputs. Learn more

  • {} 53 keys
    • 0
    • true
    • true
    • 1
    • "mmap"
    • "/mnt/ssd-1/data/pile_20B_tokenizer/pile_20B_tokenizer_text_document"
    • "nccl"
    • 1,000
    • 10
    • {} 7 keys
      • true
      • 32
      • 1
      • 0
      • 6,144
      • "small_init"
      • "/mnt/ssd-1/20B_checkpoints"
      • "/mnt/ssd-1/logs"
      • 2
      • 150,000
      • "cosine"
      • 2,048
      • 0.0000097
      • 2
      • true
      • "layernorm"
      • 64
      • 44
      • {} 2 keys
        • {} 3 keys
          • [] 2 items
            • 0.9
            • 0.95
          • 0.00000001
          • 0.000097
        • "Adam"
      • "wang_init"
      • "column"
      • false
      • 4
      • "rotary"
      • 0.25
      • "/mnt/ssd-1/20B_checkpoints"
      • 1,000
      • true
      • 2,048
      • "995,4,1"
      • 2
      • true
      • "/mnt/ssd-1/tensorboard"
      • "HFTokenizer"
      • 4
      • 150,000
      • 46 ... 48
      • "eleutherai"
      • 0.01
      • 0.01
      • {} 8 keys
      Summary

      Summary metrics are your model's outputs. Learn more

      No summary metrics saved for this run.

      Check the summary metrics documentation for more information.