descriptiveness

Created on July 16|Last edited on August 10
Comment
study batch size
﻿
objective/kl, ppo/objective/kl
objective/kl, ppo/objective/kl
5001k1.5k2kStep0246810
objective/kl_coef, ppo/objective/kl_coef
objective/kl_coef, ppo/objective/kl_coef
5001k1.5k2kStep0.10.20.30.4
objective/scores, ppo/objective/score, objective/score
objective/scores, ppo/objective/score, objective/score
5001k1.5k2kStep0123
ppo/objective/score_total, objective/score_total
ppo/objective/score_total, objective/score_total
Select runs that logged ppo/objective/score_total 
to visualize data in this line chart.
ppo/ppo/policy/approxkl, ppo/policy/approxkl_avg
ppo/ppo/policy/approxkl, ppo/policy/approxkl_avg
5001k1.5k2kStep0.0010.0020.0030.004
ppo/ppo/policy/clipfrac, ppo/policy/clipfrac_avg
ppo/ppo/policy/clipfrac, ppo/policy/clipfrac_avg
5001k1.5k2kStep00.0020.0040.0060.008
 
batch_size=641
 
batch_size=1281
 
batch_size=2561
 
batch_size=5121
 
batch_size=512, adam 1e-51
oai10
 
batch_size=512, adam 3e-51
 
oai 2
 
batch_size=512, adam 8e-51
 
batch_size=512 sgd1
 
batch_size=512, adam 1e-41
 
batch_size=512, 2e-41
 
batch_size=512 init_kl_coef=0.21
 
batch_size=512 init_kl_coef=0.252
 
batch_size=512, adam 8e-5, init_kl_coef=0.251
 
batch_size=512, adam 4e-5, init_kl_coef=0.31
 
Run set 171
 
bigger KL penalty1
 
Run set 194
 
Run set 2010
 
Run set 2118
 
new adam8
 
tensorflow-style adam10
 
tensorflow-style adam 210
 
tensorflow-style adam gpt2-medium10
 
gpt2-large1
 
gpt210
 
gpt2-xl10
 
gpt2-xl PT10
 
gpt2-medium20
 
gpt2-large20
 
Run set 310
 
gpt2 PT10
Run set 3431
 
gpt2 fix clip val10
 
alt clip val fix10
 
Run set 371
 
gpt2 fix clip val10
 
Run set 3910
Run set 4010
﻿
﻿
	
﻿
Add a comment