Skip to main content

Annavettoruzzo's group workspace

Timestamps visible
2025-05-13 23:04:16
------------------------------------------------------------------------------------------------
2025-05-13 23:04:16
 validation loss at iteration 2424 | lm loss value: 3.474080E+00 | lm loss PPL: 3.226813E+01 |
2025-05-13 23:04:16
------------------------------------------------------------------------------------------------
2025-05-13 23:04:16
/home/haok/MoE-Research/Megatron-LM/megatron/core/transformer/transformer_layer.py:339: UserWarning: TransformerLayer._get_layer_offset is deprecated.Please use get_transformer_layer_offset instead.
2025-05-13 23:04:16
  warnings.warn(
2025-05-13 23:04:37
(min, max) time across ranks (ms):
2025-05-13 23:04:37
    save-checkpoint ................................: (20436.79, 20436.95)
2025-05-13 23:05:45
(min, max) time across ranks (ms):
2025-05-13 23:05:45
    evaluate .......................................: (68429.41, 68429.91)
2025-05-13 23:05:45
------------------------------------------------------------------------------------------------------------------
2025-05-13 23:05:45
 validation loss at iteration 2424 on validation set | lm loss value: 3.473966E+00 | lm loss PPL: 3.226445E+01 |
2025-05-13 23:05:45
------------------------------------------------------------------------------------------------------------------
2025-05-13 23:07:03
(min, max) time across ranks (ms):
2025-05-13 23:07:03
    evaluate .......................................: (78539.36, 78539.79)
2025-05-13 23:07:04
------------------------------------------------------------------------------------------------------------
2025-05-13 23:07:04
 validation loss at iteration 2424 on test set | lm loss value: 3.473430E+00 | lm loss PPL: 3.224717E+01 |
2025-05-13 23:07:04
------------------------------------------------------------------------------------------------------------