Skip to main content

Quad-Swarm-RL

Created on July 21|Last edited on July 22
Below are the reward plots with both input and return normalization

200M400M600M800Mglobal_step-100-80-60-40-200
200M400M600M800Mglobal_step-6-4-2
Run set
4

As a comparison, here are some plots without normalization.

Run set
8