Skip to main content

Filter higher quality pretraining data.

Created on April 15|Last edited on April 15
Instead of pretraining on all the observation data we had, I restricted to the smaller (~1K) trials of Will's data that did have >0.5 R2. This did not help downstream performance.