Reports
Created by
Created On
Last edited
trlx: Set add_special_tokens=False to not add EOS unexpectedly #287
Unable to directly repro T5 results
0
2023-02-10
trlx: refactor - remove orchestrator abstraction from API #289
ILQL sentiments reproduction from `main`
0
2023-02-08
trlx: refactor - remove orchestrator abstraction from API #289
* PPO sentiments reproduction from `main` (gpt2)
* Summarize dailymail/cnn reproduction from `main` (google/flan-t5-small)`
0
2023-02-08
trlx: Add `bitsandbytes` optimizer support #133
Report for the following PR: https://github.com/CarperAI/trlx/pull/133
0
2023-01-04
trlx: Add `LORA` support #110
Report for the following PR: https://github.com/CarperAI/trlx/pull/110
0
2023-01-11
trlx: LORA support
Results for the ILQL Sentiment task with LORA to observe training dynamics and memory saving.
0
2023-01-07
trlx: LORA support
Results for the PPO Sentiment task with LORA to observe training dynamics and memory saving.
0
2023-01-06
trlx: Add OptimizerConfig and SchedulerConfig #135
Results for PPO/ILQL sentiments examples to demonstrate no regressions.
0
2022-12-14
trlx: `accelerate` Multi-Node DDP Benchmark
PPO Sentiments Benchmark on Multi-Node DDP setup
0
2022-12-07
Hydra GPT-J PPO Sentiment
Hydra GPT-J with all but one layer unfrozen in the base backbone
0
2022-12-06