Skip to main content

Kastan's group workspace

Timestamps visible
2022-10-04 21:16:21
ValueError: throughput not available not enough values to unpack (expected 2, got 1)
2022-10-04 21:16:23
ValueError: throughput not available not enough values to unpack (expected 2, got 1)
2022-10-04 21:16:23
[10/04/22 16:16:07] INFO     colossalai - colossalai - INFO:
2022-10-04 21:16:23
                             /scratch/bbki/kastanday/conda_envs/envs/nice_base/e
2022-10-04 21:16:23
                             nvs/col_ai_quant/lib/python3.9/site-packages/coloss
2022-10-04 21:16:23
                             alai/trainer/hooks/_log_hook.py:99
2022-10-04 21:16:23
                             after_train_epoch
2022-10-04 21:16:23
                    INFO     colossalai - colossalai - INFO: [Epoch 1 / Train]:
2022-10-04 21:16:23
                             Loss = 40.485 | LR = 7.5e-05 | Throughput = 94.042
2022-10-04 21:16:23
                    INFO     colossalai - colossalai - INFO:
2022-10-04 21:16:23
                             /scratch/bbki/kastanday/conda_envs/envs/nice_base/e
2022-10-04 21:16:23
                             nvs/col_ai_quant/lib/python3.9/site-packages/coloss
2022-10-04 21:16:23
                             alai/utils/memory.py:91 report_memory_usage
2022-10-04 21:16:23
                    INFO     colossalai - colossalai - INFO: [Epoch 1 / Train]:
2022-10-04 21:16:23
                             GPU: allocated 969.31 MB, max allocated 1858.39 MB,
2022-10-04 21:16:23
                             cached: 1984.0 MB, max cached: 1984.0 MB
2022-10-04 21:16:23
                    INFO     colossalai - colossalai - INFO:
2022-10-04 21:16:23
                             /u/kastanday/LLM-Distributed-Quantization/wandb_log
2022-10-04 21:16:23
                             s/custom_wandb_log_hook.py:125 after_train
2022-10-04 21:16:23
                    INFO     colossalai - colossalai - INFO: training finished