Regression Report: wandb
[['?we=costa-huang&wpn=trl&ceik=tracker_project_name&cen=log_with&metrics=env/reward_mean&metrics=env/reward_std&metrics=objective/kl_coef&metrics=objective/kl&metrics=objective/entropy&metrics=ppo/std_scores&metrics=ppo/mean_scores&metrics=ppo/learning_rate&metrics=ppo/mean_non_score_reward&metrics=ppo/loss/value&metrics=ppo/loss/total&metrics=ppo/loss/policy&metrics=ppo/policy/advantages_mean&metrics=ppo/policy/approxkl&metrics=ppo/policy/clipfrac&metrics=ppo/policy/entropy&metrics=ppo/returns/mean&metrics=ppo/returns/var', 'wandb?tag=gpt2-sentiment&cl=sentiment analysis (PR-410)']]
Created on June 8|Last edited on June 8
Comment
Computing group metrics from first 50 groups
Computing group metrics from first 50 groups
Computing group metrics from first 50 groups
Computing group metrics from first 50 groups
Computing group metrics from first 50 groups
Computing group metrics from first 50 groups
sentiment analysis (PR-410)
236
Add a comment