Skip to main content

Regression Report: ppo_continuous_action

['ddpg_continuous_action_jax?user=joaogui1&tag=rlops-pilot', 'ddpg_continuous_action_jax?user=joaogui1&tag=pr-298', 'ddpg_continuous_action_jax?user=costa-huang&tag=rlops-pilot', 'ddpg_continuous_action_jax?user=costa-huang&tag=pr-298', 'ddpg_continuous_action?user=costa-huang&tag=pr-299', 'ppo_continuous_action?user=costa-huang&tag=rlops-pilot']
Created on November 8|Last edited on November 8

200k400k600k800kSteps100020003000Episodic Return
1020304050Time (minutes)50010001500200025003000Episodic Return
CleanRL's ddpg_continuous_action_jax ({'user': ['joaogui1'], 'tag': ['rlops-pilot']})
3
CleanRL's ddpg_continuous_action_jax ({'user': ['joaogui1'], 'tag': ['pr-298']})
3
CleanRL's ddpg_continuous_action_jax ({'user': ['costa-huang'], 'tag': ['rlops-pilot']})
3
CleanRL's ddpg_continuous_action_jax ({'user': ['costa-huang'], 'tag': ['pr-298']})
3
CleanRL's ddpg_continuous_action ({'user': ['costa-huang'], 'tag': ['pr-299']})
3
CleanRL's ppo_continuous_action ({'user': ['costa-huang'], 'tag': ['rlops-pilot']})
10



CleanRL's ddpg_continuous_action_jax ({'user': ['joaogui1'], 'tag': ['rlops-pilot']})
3
CleanRL's ddpg_continuous_action_jax ({'user': ['joaogui1'], 'tag': ['pr-298']})
3
CleanRL's ddpg_continuous_action_jax ({'user': ['costa-huang'], 'tag': ['rlops-pilot']})
3
CleanRL's ddpg_continuous_action_jax ({'user': ['costa-huang'], 'tag': ['pr-298']})
3
CleanRL's ddpg_continuous_action ({'user': ['costa-huang'], 'tag': ['pr-299']})
3
CleanRL's ppo_continuous_action ({'user': ['costa-huang'], 'tag': ['rlops-pilot']})
10



CleanRL's ddpg_continuous_action_jax ({'user': ['joaogui1'], 'tag': ['rlops-pilot']})
3
CleanRL's ddpg_continuous_action_jax ({'user': ['joaogui1'], 'tag': ['pr-298']})
3
CleanRL's ddpg_continuous_action_jax ({'user': ['costa-huang'], 'tag': ['rlops-pilot']})
3
CleanRL's ddpg_continuous_action_jax ({'user': ['costa-huang'], 'tag': ['pr-298']})
3
CleanRL's ddpg_continuous_action ({'user': ['costa-huang'], 'tag': ['pr-299']})
3
CleanRL's ppo_continuous_action ({'user': ['costa-huang'], 'tag': ['rlops-pilot']})
10