Skip to main content

Kastan's group workspace

gpt2_vanilla

What makes this group special?
Tags

gpt2_vanilla

Notes

only 3 gpu (and only 2 used I think)

Tags
only_3_gpu
Author
State
Finished
Start time
June 21st, 2022 6:17:57 PM
Runtime
7m 28s
Tracked hours
7m 24s
Run path
kastan/col_ai/npoonhjp
OS
Linux-4.18.0-348.23.1.el8_5.x86_64-x86_64-with-glibc2.10
Python version
3.8.12
Git repository
git clone https://github.com/hpcaitech/ColossalAI-Examples.git
Git state
git checkout -b "gpt2_vanilla" 1371a5ba8bc454d081ecb0d88f7b3f801dde01f6
Command
train_gpt.py --config gpt2_configs/gpt2_vanilla.py --from_torch
System Hardware
CPU count128
GPU count4
GPU typeNVIDIA A100-SXM4-40GB
W&B CLI Version
0.12.18
Config

Config parameters are your model's inputs. Learn more

  • {} 18 keys
    • "nccl"
    • 1
    • "gpt2_configs/gpt2_vanilla.py"
    • "gpt2_configs/gpt2_vanilla.py"
    • "/u/kastan/colossal/data/train_data_FINAL.json"
    • {} 1 key
      • "AMP_TYPE.NAIVE"
    • true
    • "titans.model.gpt.gpt.gpt2_small"
    • null
    • null
    • {} 2 keys
      • true
      • "titans.model.gpt.gpt.gpt2_small"
    • 60
    • {} 3 keys
      • 0.00015
      • "torch.optim.adam.Adam"
      • 0.01
    • {} 2 keys
      • 1
      • {} 2 keys
        • null
        • 1
    • null
    • null
    • 1,024
    • null
Summary

Summary metrics are your model's outputs. Learn more

  • {} 6 keys
    • "Goodbye, we finished training"
    • "Hi, we got started"
    • 36.07942962646485
    • 36.07942962646485
    • 2.6421
    • 0.81677
Artifact Outputs

This run produced these artifacts as outputs. Total: 1. Learn more

Loading...