Skip to main content

Levmckinney's group workspace

Timestamps visible
2023-05-05 04:54:07
Using 'GPTNeoXLayer' for transformer_auto_wrap_policy.
2023-05-05 04:54:25
Gradient accumulation steps: 64
2023-05-05 04:54:25
Using 262_144 tokens per training step.
2023-05-05 04:54:25
All processes have completed setup. Starting training.
2023-05-05 10:53:22
Training: 100%|██████████| 16000/16000 [5:58:56<00:00,  1.35s/it]
2023-05-05 10:53:23
Saving lens to /output/EleutherAI/pythia-6.9b-deduped-1683261951