Alpha3 tune (Decaying Lambda lr as actor lr)

Overview of alph3 hyperparameter tuning.

Created on February 28|Last edited on February 28

Comment

This report contains the results of the alpha3 hyperparameter tuning, where we used a linearly decaying λ\lambdaλ﻿ learning rate which was set equal to the actor learning rate.
CartPole﻿
AverageTestEpRet
AverageTestEpRet
Select runs that logged AverageTestEpRet 
to visualize data in this line chart.
AverageLambda
AverageLambda
Select runs that logged AverageLambda 
to visualize data in this line chart.
Run set0
﻿
GRN﻿
﻿
Run set0
﻿
CompGRN﻿
﻿
Run set0
﻿
FetchReach﻿
Run set0
﻿
﻿

Add a comment