Skip to main content

SBX TD3 - PR-21 perf check

Created on January 12|Last edited on January 12

HalfCheetah-v4


200k400k600k800k1Mglobal_step1000200030004000
algo: td3 PR-21
algo: td3, saved_hyperparams.learning_rate: 0.001 TD3 RL Zoo 2.3.0
algo: td3 TD3 RL Zoo 2.2.1
200k400k600k800k1Mglobal_step200040006000800010000
algo: td3, saved_hyperparams.learning_rate: 0.001 TD3 RL Zoo 2.3.0
algo: td3 PR-21
algo: td3 TD3 RL Zoo 2.2.1
00.050.10.150.20.250.3Time (minutes)-2-1012
algo: td3 PR-21
algo: td3, saved_hyperparams.learning_rate: 0.001 TD3 RL Zoo 2.3.0
algo: td3 TD3 RL Zoo 2.2.1
TD3 RL Zoo 2.3.0
4
TD3 RL Zoo 2.2.1
11
PR-21
3



Ant-v4


TD3 RL Zoo 2.3.0
6
TD3 RL Zoo 2.2.1
13
pr-21
3



Hopper-v4


TD3 RL Zoo 2.3.0
6
TD3 RL Zoo 2.2.1
10
pr-21
3



Walker2d-v4


TD3 RL Zoo 2.3.0
6
TD3 RL Zoo 2.2.1
10
pr-21
3



Swimmer-v4


TD3 RL Zoo 2.3.0
6
TD3 RL Zoo 2.2.1
10
pr-21
3