Skip to main content
Reports
Created by
Created On
Last edited
0
2023-03-17
0
2023-02-10
0
2023-02-08
trlx: refactor - remove orchestrator abstraction from API #289
* PPO sentiments reproduction from `main` (gpt2) * Summarize dailymail/cnn reproduction from `main` (google/flan-t5-small)`
0
2023-02-08
trlx: Add `bitsandbytes` optimizer support #133
Report for the following PR: https://github.com/CarperAI/trlx/pull/133
0
2023-01-04
trlx: Add `LORA` support #110
Report for the following PR: https://github.com/CarperAI/trlx/pull/110
0
2023-01-11
0
2023-01-10
trlx: LORA support
Results for the ILQL Sentiment task with LORA to observe training dynamics and memory saving.
0
2023-01-07
trlx: LORA support
Results for the PPO Sentiment task with LORA to observe training dynamics and memory saving.
0
2023-01-06
0
2022-12-17
trlx: Add OptimizerConfig and SchedulerConfig #135
Results for PPO/ILQL sentiments examples to demonstrate no regressions.
0
2022-12-14
trlx: `accelerate` Multi-Node DDP Benchmark
PPO Sentiments Benchmark on Multi-Node DDP setup
0
2022-12-07
Hydra GPT-J PPO Sentiment
Hydra GPT-J with all but one layer unfrozen in the base backbone
0
2022-12-06
GPT-J PPO Sentiment
Hydra GPT-J with all but one layer unfrozen in the base backbone
0
2022-12-06
trlx: GPT-Neo PPO Sentiment
ILQL sentiment results for the `Add GPTNeo support` PR.
0
2022-12-06
trlx: GPT-Neo PPO Sentiment
PPO sentiment results for the `Add GPTNeo support` PR.
0
2022-12-06
trlx: ILQL Sentiment Benchmark
ILQL Benchmark results for the wider Causal LM support PR.
0
2022-12-05
trlx: PPO Sentiment Benchmark
PPO Benchmark results for the wider Causal LM support PR.
0
2022-12-05
0
2022-12-04
0
2022-12-01