Skip to main content

Annavettoruzzo's group workspace

Timestamps visible
2025-05-14 01:43:53
------------------------------------------------------------------------------------------------
2025-05-14 01:43:53
 validation loss at iteration 4165 | lm loss value: 3.280041E+00 | lm loss PPL: 2.657687E+01 |
2025-05-14 01:43:53
------------------------------------------------------------------------------------------------
2025-05-14 01:43:53
/home/haok/MoE-Research/Megatron-LM/megatron/core/transformer/transformer_layer.py:339: UserWarning: TransformerLayer._get_layer_offset is deprecated.Please use get_transformer_layer_offset instead.
2025-05-14 01:43:53
  warnings.warn(
2025-05-14 01:44:13
(min, max) time across ranks (ms):
2025-05-14 01:44:13
    save-checkpoint ................................: (20427.66, 20427.84)
2025-05-14 01:46:01
(min, max) time across ranks (ms):
2025-05-14 01:46:01
    evaluate .......................................: (107643.64, 107644.57)
2025-05-14 01:46:01
------------------------------------------------------------------------------------------------------------------
2025-05-14 01:46:01
 validation loss at iteration 4165 on validation set | lm loss value: 3.279893E+00 | lm loss PPL: 2.657292E+01 |
2025-05-14 01:46:01
------------------------------------------------------------------------------------------------------------------
2025-05-14 01:48:00
(min, max) time across ranks (ms):
2025-05-14 01:48:00
    evaluate .......................................: (118829.98, 118830.70)
2025-05-14 01:48:00
------------------------------------------------------------------------------------------------------------
2025-05-14 01:48:00
 validation loss at iteration 4165 on test set | lm loss value: 3.279224E+00 | lm loss PPL: 2.655516E+01 |
2025-05-14 01:48:00
------------------------------------------------------------------------------------------------------------