Skip to main content
W&B will be performing maintenance on Saturday, Nov 22nd starting at 6:00 PM PST. The UI and API may be intermittently unavailable during this time. Thank you for your patience and visit https://status.wandb.com for updates.

Slimshadys's group workspace

Timestamps visible
2023-12-08 22:35:23
Total Iteration Time: 9,00002
2023-12-08 22:35:23
Cumulative Model Updates: 1.285
2023-12-08 22:35:23
Cumulative Timesteps: 32.163.562
2023-12-08 22:35:23
Timesteps Collected: 50.018
2023-12-08 22:35:23
--------END ITERATION REPORT--------
2023-12-08 22:35:33
--------BEGIN ITERATION REPORT--------
2023-12-08 22:35:33
Policy Reward: 14,74833
2023-12-08 22:35:33
Policy Entropy: 0,25240
2023-12-08 22:35:33
Value Function Loss: 0,42019
2023-12-08 22:35:33
Mean KL Divergence: 0,00350
2023-12-08 22:35:33
SB3 Clip Fraction: 0,03446
2023-12-08 22:35:33
Policy Update Magnitude: 0,12955
2023-12-08 22:35:33
Value Function Update Magnitude: 0,18158
2023-12-08 22:35:33
Collected Steps per Second: 7.463,44111
2023-12-08 22:35:33
Overall Steps per Second: 5.971,93834
2023-12-08 22:35:33
Timestep Collection Time: 6,70066
2023-12-08 22:35:33
Timestep Consumption Time: 1,67350
2023-12-08 22:35:33
PPO Batch Consumption Time: 0,16122
2023-12-08 22:35:33
Total Iteration Time: 8,37417
2023-12-08 22:35:33
Cumulative Model Updates: 1.287
2023-12-08 22:35:33
Cumulative Timesteps: 32.213.572
2023-12-08 22:35:33
Timesteps Collected: 50.010
2023-12-08 22:35:33
--------END ITERATION REPORT--------
2023-12-08 22:35:33
Saving checkpoint 32213572...
2023-12-08 22:35:33
Checkpoint 32213572 saved!
2023-12-08 22:35:41
--------BEGIN ITERATION REPORT--------
2023-12-08 22:35:41
Policy Reward: 15,10587
2023-12-08 22:35:41
Policy Entropy: 0,25114
2023-12-08 22:35:41
Value Function Loss: 0,42456
2023-12-08 22:35:41
Mean KL Divergence: 0,00389
2023-12-08 22:35:41
SB3 Clip Fraction: 0,04017
2023-12-08 22:35:41
Policy Update Magnitude: 0,13155
2023-12-08 22:35:41
Value Function Update Magnitude: 0,15167
2023-12-08 22:35:41
Collected Steps per Second: 6.755,18820
2023-12-08 22:35:41
Overall Steps per Second: 5.376,61346
2023-12-08 22:35:41
Timestep Collection Time: 7,40438
2023-12-08 22:35:41
Timestep Consumption Time: 1,89850
2023-12-08 22:35:41
PPO Batch Consumption Time: 0,15978
2023-12-08 22:35:41
Total Iteration Time: 9,30288
2023-12-08 22:35:41
Cumulative Model Updates: 1.289
2023-12-08 22:35:41
Cumulative Timesteps: 32.263.590
2023-12-08 22:35:41
Timesteps Collected: 50.018
2023-12-08 22:35:41
--------END ITERATION REPORT--------
2023-12-08 22:35:49
--------BEGIN ITERATION REPORT--------
2023-12-08 22:35:49
Policy Reward: 13,77584
2023-12-08 22:35:49
Policy Entropy: 0,23988
2023-12-08 22:35:49
Value Function Loss: 0,42188
2023-12-08 22:35:49
Mean KL Divergence: 0,00608
2023-12-08 22:35:49
SB3 Clip Fraction: 0,04770
2023-12-08 22:35:49
Policy Update Magnitude: 0,12393
2023-12-08 22:35:49
Value Function Update Magnitude: 0,15713
2023-12-08 22:35:49
Collected Steps per Second: 7.234,96852
2023-12-08 22:35:49
Overall Steps per Second: 5.842,69502
2023-12-08 22:35:49
Timestep Collection Time: 6,91558
2023-12-08 22:35:49
Timestep Consumption Time: 1,64793
2023-12-08 22:35:49
PPO Batch Consumption Time: 0,15378
2023-12-08 22:35:49
Total Iteration Time: 8,56351
2023-12-08 22:35:49
Cumulative Model Updates: 1.291
2023-12-08 22:35:49
Cumulative Timesteps: 32.313.624
2023-12-08 22:35:49
Timesteps Collected: 50.034
2023-12-08 22:35:49
--------END ITERATION REPORT--------
2023-12-08 22:35:49
Saving checkpoint 32313624...
2023-12-08 22:35:49
Checkpoint 32313624 saved!
2023-12-08 22:35:59
--------BEGIN ITERATION REPORT--------
2023-12-08 22:35:59
Policy Reward: 15,41218
2023-12-08 22:35:59
Policy Entropy: 0,23969
2023-12-08 22:35:59
Value Function Loss: 0,43899
2023-12-08 22:35:59
Mean KL Divergence: 0,00454
2023-12-08 22:35:59
SB3 Clip Fraction: 0,04252
2023-12-08 22:35:59
Policy Update Magnitude: 0,11529
2023-12-08 22:35:59
Value Function Update Magnitude: 0,16581
2023-12-08 22:35:59
Collected Steps per Second: 6.959,04673
2023-12-08 22:35:59
Overall Steps per Second: 5.491,30778
2023-12-08 22:35:59
Timestep Collection Time: 7,18633
2023-12-08 22:35:59
Timestep Consumption Time: 1,92079
2023-12-08 22:35:59
PPO Batch Consumption Time: 0,15778
2023-12-08 22:35:59
Total Iteration Time: 9,10712
2023-12-08 22:35:59
Cumulative Model Updates: 1.293
2023-12-08 22:35:59
Cumulative Timesteps: 32.363.634
2023-12-08 22:35:59
Timesteps Collected: 50.010
2023-12-08 22:35:59
--------END ITERATION REPORT--------
2023-12-08 22:36:09
--------BEGIN ITERATION REPORT--------
2023-12-08 22:36:09
Policy Reward: 15,40718
2023-12-08 22:36:09
Policy Entropy: 0,23502
2023-12-08 22:36:09
Value Function Loss: 0,43022
2023-12-08 22:36:09
Mean KL Divergence: 0,00263
2023-12-08 22:36:09
SB3 Clip Fraction: 0,02825
2023-12-08 22:36:09
Policy Update Magnitude: 0,11576
2023-12-08 22:36:09
Value Function Update Magnitude: 0,15486
2023-12-08 22:36:09
Collected Steps per Second: 6.951,34862
2023-12-08 22:36:09
Overall Steps per Second: 5.524,17814
2023-12-08 22:36:09
Timestep Collection Time: 7,19774
2023-12-08 22:36:09
Timestep Consumption Time: 1,85953
2023-12-08 22:36:09
PPO Batch Consumption Time: 0,15828
2023-12-08 22:36:09
Total Iteration Time: 9,05727
2023-12-08 22:36:09
Cumulative Model Updates: 1.295
2023-12-08 22:36:09
Cumulative Timesteps: 32.413.668
2023-12-08 22:36:09
Timesteps Collected: 50.034
2023-12-08 22:36:09
--------END ITERATION REPORT--------
2023-12-08 22:36:09
Saving checkpoint 32413668...
2023-12-08 22:36:09
Checkpoint 32413668 saved!