Skip to main content

Amr-amr's group workspace

Timestamps visible
2025-04-03 07:24:44
    throughput/device/batches_per_second=0.7797
2025-04-03 07:24:45
2025-04-03 03:24:45.933 cn-l049.server.mila.quebec:0 olmo.train:967 INFO [step=149969/262144,epoch=0]
2025-04-03 07:24:45
    optim/total_grad_norm=0.2515
2025-04-03 07:24:45
    train/CrossEntropyLoss=2.835
2025-04-03 07:24:45
    train/Perplexity=17.02
2025-04-03 07:24:45
    throughput/total_tokens=78,626,947,072
2025-04-03 07:24:45
    throughput/total_training_Gflops=44,567,394,073
2025-04-03 07:24:45
    throughput/total_training_log_Gflops=24.52
2025-04-03 07:24:45
    throughput/device/tokens_per_second=25,549
2025-04-03 07:24:45
    throughput/device/batches_per_second=0.7797
2025-04-03 07:24:47
2025-04-03 03:24:47.216 cn-l049.server.mila.quebec:0 olmo.train:967 INFO [step=149970/262144,epoch=0]
2025-04-03 07:24:47
    optim/total_grad_norm=0.2591
2025-04-03 07:24:47
    train/CrossEntropyLoss=2.776
2025-04-03 07:24:47
    train/Perplexity=16.05
2025-04-03 07:24:47
    throughput/total_tokens=78,627,471,360
2025-04-03 07:24:47
    throughput/total_training_Gflops=44,567,691,250
2025-04-03 07:24:47
    throughput/total_training_log_Gflops=24.52
2025-04-03 07:24:47
    throughput/device/tokens_per_second=25,549
2025-04-03 07:24:47
    throughput/device/batches_per_second=0.7797
2025-04-03 07:24:47
    System/Peak GPU Memory (MB)=30,452