Skip to main content

Amr-amr's group workspace

Timestamps visible
2024-10-29 21:24:00
    throughput/device/batches_per_second=11.07
2024-10-29 21:24:00
    System/Peak GPU Memory (MB)=67,040
2024-10-29 21:24:00
2024-10-29 17:24:00.209 cn-n001.server.mila.quebec:0 olmo.train:967 INFO [step=235711/262144,epoch=0]
2024-10-29 21:24:00
    optim/total_grad_norm=1.062
2024-10-29 21:24:00
    train/CrossEntropyLoss=4.340
2024-10-29 21:24:00
    train/Perplexity=76.69
2024-10-29 21:24:00
    throughput/total_tokens=123,580,448,768
2024-10-29 21:24:00
    throughput/total_training_Gflops=1,206,746,275
2024-10-29 21:24:00
    throughput/total_training_log_Gflops=20.91
2024-10-29 21:24:00
    throughput/device/tokens_per_second=1,446,019
2024-10-29 21:24:00
    throughput/device/batches_per_second=11.03
2024-10-29 21:24:00
2024-10-29 17:24:00.301 cn-n001.server.mila.quebec:0 olmo.train:967 INFO [step=235712/262144,epoch=0]
2024-10-29 21:24:00
    optim/total_grad_norm=1.013
2024-10-29 21:24:00
    train/CrossEntropyLoss=4.366
2024-10-29 21:24:00
    train/Perplexity=78.72
2024-10-29 21:24:00
    throughput/total_tokens=123,580,973,056
2024-10-29 21:24:00
    throughput/total_training_Gflops=1,206,751,394
2024-10-29 21:24:00
    throughput/total_training_log_Gflops=20.91
2024-10-29 21:24:00
    throughput/device/tokens_per_second=1,446,166
2024-10-29 21:24:00
    throughput/device/batches_per_second=11.03