Skip to main content

APEX Mixed IL and RL

We train APEX with DQfD sytle Imitation loss for Imitation Learning Samples. The ratio of training IL and RL is kept constant at 25%
Created on August 8|Last edited on August 8

Section 1

Add markdown, images, and LaTeXLaTeX




Select runs that logged custom_metrics/episode_score_normalized_mean
to visualize data in this line chart.
Select runs that logged evaluation/custom_metrics/episode_score_normalized_max
to visualize data in this line chart.
MARWIL
0



Run set
3