Skip to main content
Reports
Created by
Created On
Last edited
PQN - PR 472
First experiments on Atari
0
2024-07-18
Regression Report: ddpg_continuous_action_jax
[['?we=openrlbenchmark&wpn=cleanrl&ceik=env_id&cen=exp_name&metric=charts/episodic_return', 'ddpg_continuous_action_jax?tag=pr-371-jax', 'ddpg_continuous_action_jax?tag=rlops-pilot']]
0
2023-04-10
Regression Report: ddpg_continuous_action
[['?we=openrlbenchmark&wpn=cleanrl&ceik=env_id&cen=exp_name&metric=charts/episodic_return', 'ddpg_continuous_action?tag=pr-371', 'ddpg_continuous_action?tag=rlops-pilot']]
0
2023-04-07
0
2023-02-03
RPO on dm_control Part 2
['ppo_continuous_action_8M?tag=v1.0.0-13-gcbd83f6', 'rpo_continuous_action?tag=pr-331']
0
2023-01-03
RPO on dm_control Part 1
['ppo_continuous_action_8M?tag=v1.0.0-13-gcbd83f6', 'rpo_continuous_action?tag=pr-331']
0
2023-01-03
RPO (alpha=0.5) on Mujoco_v2 Part 2
['ppo_continuous_action_8M?tag=v1.0.0-13-gcbd83f6', 'rpo_continuous_action?tag=pr-331']
0
2023-01-03
RPO on Mujoco_v2 Part 2
['ppo_continuous_action_8M?tag=v1.0.0-13-gcbd83f6', 'rpo_continuous_action_alpha_0_01?tag=pr-331']
0
2023-01-03
RPO on Mujoco_v2 Part 1
['ppo_continuous_action_8M?tag=v1.0.0-13-gcbd83f6', 'rpo_continuous_action?tag=pr-331']
0
2023-01-03
RPO alpha=0.5's failure cases on Mujoco_v2
['ppo_continuous_action_8M?tag=v1.0.0-13-gcbd83f6', 'rpo_continuous_action?tag=pr-331']
0
2023-01-03
RPO alpha=0.5's failure cases on Mujoco_v4
['ppo_continuous_action_8M?tag=v1.0.0-13-gcbd83f6', 'rpo_continuous_action?tag=pr-331']
0
2023-01-03
RPO (alpha=0.5) on Mujoco_v4 Part 2
['ppo_continuous_action_8M?tag=v1.0.0-13-gcbd83f6', 'rpo_continuous_action?tag=pr-331']
0
2023-01-03
RPO on Mujoco_v4 Part 2
['ppo_continuous_action_8M?tag=v1.0.0-13-gcbd83f6', 'rpo_continuous_action_alpha_0_01?tag=pr-331']
0
2023-01-03
RPO on Gym (Gymnasium)
['ppo_continuous_action_8M?tag=v1.0.0-13-gcbd83f6', 'rpo_continuous_action?tag=pr-331']
0
2023-01-03
RPO on Mujoco_v4 Part 1
['ppo_continuous_action_8M?tag=v1.0.0-13-gcbd83f6', 'rpo_continuous_action?tag=pr-331']
0
2023-01-03
0
2022-10-12
Atari: CleanRL's sac_atari.py
['sac_atari?tag=pr-270&tag=v1.0.0b1-43-g6f7251f']
0
2022-11-13
CleanRL SAC (jax)
Adapted from SBX (SB3 + Jax) implementation
0
2022-10-23
[WIP] APO on Gym Mujoco
APO performance on 3 seeds 1M steps
0
2022-06-22
CleanRL PPG vs PPO results
Tracked runs for CleanRL implementation of Phasic Policy Gradient and comparision to PPO.
0
2022-05-25