Skip to main content
Reports
Created by
Created On
Last edited
Proximal Policy Optimization (PPO) Experiments
Here we can observe three experiments with PPO RL algorithm (with different number of timesteps {2000,5000,10000}) seperately (with non-smoothing form) and finally all three together (with smoothing form).
2
2021-02-28
Deep Q-Network (DQN) Experiments
Here we can observe three experiments with DQN RL algorithm (with different number of timesteps {2000,5000,10000}) seperately (with non-smoothing form) and finally all three together (with smoothing form).
1
2021-02-28
0
2021-09-17
Αctor-Critic (A2C) Experiments
Here we can observe three experiments with A2C RL algorithm (with different number of n_steps {16,36}) seperately (with non-smoothing form) and finally both of them (with smoothing form).
1
2021-02-28