Skip to main content

DroQ + CrossQ- SBX Perf Report

Created on April 1|Last edited on July 22
Combining CrossQ with the DroQ configuration( UTD=10, dropout_rate=0.01).
Note: using the recommended gamma=0.98 for PyBullet envs
Only 2 random seeds so far, need more runs.
The only env where it is inconsistent is Swimmer-v4.
It also underperforms when compared to TQC on BipedalWalkerHardcore-v3.

PyBullet Envs

HalfCheetahBulletEnv-v0


100k200k300k400kglobal_step05001000150020002500
CrossQ - SBX
DroQ + CrossQ
TQC - SB3
SAC - SB3
SBX
8
SB3
22


AntBulletEnv-v0


SBX
3
SB3
21
DroQ
2


HopperBulletEnv-v0


SBX
6
SB3
21


Walker2DBulletEnv-v0


SBX
8
SB3
22


MuJoCo Envs

HalfCheetah-v4


SBX
6
SB3
20


Ant-v4


SBX
6
SB3
20


Hopper-v4


SBX
6
SB3
20


Walker2d-v4


SBX
6
SB3
20