Regression Report: wandb
[['?we=costa-huang&wpn=trl&xaxis=_step&ceik=trl_ppo_trainer_config.value.tracker_project_name&cen=trl_ppo_trainer_config.value.log_with&metrics=env/reward_mean&metrics=objective/kl', 'wandb?tag=gpt2-sentiment&tag=pr-423&cl=sentiment analysis (PR-423)', 'wandb?tag=gpt2-sentiment-1-nminibs&cl=sentiment analysis (no minibatches, target kl=6)']]
Created on June 28|Last edited on June 28
Comment
sentiment analysis (PR-423)
9
sentiment analysis (no minibatches, target kl=6)
10
Add a comment