Reports
Created by
Created On
Last edited
An Example of Transformer Reinforcement Learning
In this article, we take a look at the logged metrics and gradients from a GPT-2 experiment that is tasked with writing favorable reviews for movies.
10
2020-05-13
carey
lavanyashukla