Note: using the recommended gamma=0.98 for PyBullet envs
Only 3 random seeds so far but the variance is quite low.
PyBullet Envs
HalfCheetahBulletEnv-v0
TQC w/ SimBa on HalfCheetahBulletEnv-v0
TQC w/ SimBa on HalfCheetahBulletEnv-v0
Select runs that logged eval/mean_reward
to visualize data in this line chart.
AntBulletEnv-v0
HopperBulletEnv-v0
Walker2DBulletEnv-v0
MuJoCo Envs
HalfCheetah-v4
Ant-v4
Hopper-v4
Walker2d-v4
Swimmer-v4