v0.5 | unirome_collab Logs – Weights & Biases

Skip to main content

Slimshadys's group workspace

Group: v0.5

8572

1-1

of 1

Timestamps visible

2023-12-10 13:51:59

--------BEGIN ITERATION REPORT--------

2023-12-10 13:51:59

Policy Reward: 44,57078

2023-12-10 13:51:59

Policy Entropy: 3,26351

2023-12-10 13:51:59

Value Function Loss: 0,26440

2023-12-10 13:51:59

Mean KL Divergence: 0,00928

2023-12-10 13:51:59

SB3 Clip Fraction: 0,10111

2023-12-10 13:51:59

Policy Update Magnitude: 0,40770

2023-12-10 13:51:59

Value Function Update Magnitude: 0,24873

2023-12-10 13:51:59

Collected Steps per Second: 6.783,87269

2023-12-10 13:51:59

Overall Steps per Second: 5.249,87702

2023-12-10 13:51:59

Timestep Collection Time: 7,37514

2023-12-10 13:51:59

Timestep Consumption Time: 2,15499

2023-12-10 13:51:59

PPO Batch Consumption Time: 0,15576

2023-12-10 13:51:59

Total Iteration Time: 9,53013

2023-12-10 13:51:59

Cumulative Model Updates: 1.350

2023-12-10 13:51:59

Cumulative Timesteps: 22.560.830

2023-12-10 13:51:59

Timesteps Collected: 50.032

2023-12-10 13:51:59

--------END ITERATION REPORT--------

2023-12-10 13:51:59

Saving checkpoint 22560830...

2023-12-10 13:51:59

Checkpoint 22560830 saved!