Skip to main content

[Offline] BC 10%

Created on September 28|Last edited on June 7
Results are averaged over 4 seeds. For each dataset we plot d4rl normalized score.
Locomotion reference scores are from Offline Reinforcement Learning with Implicit Q-Learning

Locomotion

Maze2d

AntMaze

Adroit