Skip to main content

SBX SAC - influence of learning rate

lr=3e-4 vs lr=1e-3
Created on January 11|Last edited on January 11

HalfCheetah-v4


200k400k600k800k1Mglobal_step100020003000
algo: sac, saved_hyperparams.learning_rate: 0.001 TD3 RL Zoo 2.3.0
algo: sac TD3 RL Zoo 2.2.1
200k400k600k800k1Mglobal_step020004000600080001000012000
algo: sac, saved_hyperparams.learning_rate: 0.001 TD3 RL Zoo 2.3.0
algo: sac TD3 RL Zoo 2.2.1
TD3 RL Zoo 2.3.0
3
TD3 RL Zoo 2.2.1
10



Ant-v4


TD3 RL Zoo 2.3.0
3
TD3 RL Zoo 2.2.1
10



Hopper-v4


TD3 RL Zoo 2.3.0
3
TD3 RL Zoo 2.2.1
10



Walker2d-v4


TD3 RL Zoo 2.3.0
3
TD3 RL Zoo 2.2.1
10



Swimmer-v4


TD3 RL Zoo 2.3.0
3
TD3 RL Zoo 2.2.1
10