Skip to main content
Reports
Created by
Created On
Last edited
BoN v. main
BoN @9680c9f/Merge branch 'main' into bon-x/2023-09-01 main @bf0fe38/fix(examples/hh): old gpt-j checkpoint loading (#552)/2023-09-01
0
2023-09-01
BoN v. main
BoN @a022d3f/Merge branch 'main' into BoN/2023-09-01 main @a7eb066/chore(requirements.txt): update everything to the latest (#547)/2023-08-31
0
2023-09-01
update-requirements v. main
update-requirements @bfb53f6/chore(requirements.txt): update everything to the latest/2023-08-21 main @900040a/mpt support (#546)/2023-08-21
0
2023-08-21
BoN v. main
BoN @36f06af/Remove prints/2023-08-07 main @2e667e6/fix(modeling): deepspeed checkpoint loading (#482)/2023-08-08
0
2023-08-11
value-branch v. main
value-branch @d48c1a4/Fix: Include num_value_layers_frozen arg in Seq2Seq model init/2023-07-26 main @e36fe9d/fix(modeling_ppo): load reference head under zero3 (#489)/2023-07-24
0
2023-07-29
value-branch v. main
value-branch @d48c1a4/Fix: Include num_value_layers_frozen arg in Seq2Seq model init/2023-07-26 main @e36fe9d/fix(modeling_ppo): load reference head under zero3 (#489)/2023-07-24
0
2023-07-29
value-branch v. main
value-branch @d48c1a4/Fix: Include num_value_layers_frozen arg in Seq2Seq model init/2023-07-26 main @e36fe9d/fix(modeling_ppo): load reference head under zero3 (#489)/2023-07-24
0
2023-07-29
value-branch v. main
value-branch @a875c4a/Fixing style/2023-07-18 main @e36fe9d/fix(modeling_ppo): load reference head under zero3 (#489)/2023-07-24
0
2023-07-25
value-branch v. main
value-branch @a875c4a/Fixing style/2023-07-18 main @e36fe9d/fix(modeling_ppo): load reference head under zero3 (#489)/2023-07-24
0
2023-07-25
value-branch v. main
value-branch @a875c4a/Fixing style/2023-07-18 main @e36fe9d/fix(modeling_ppo): load reference head under zero3 (#489)/2023-07-24
0
2023-07-25
value-branch v. main
value-branch @a875c4a/Fixing style/2023-07-18 main @e36fe9d/fix(modeling_ppo): load reference head under zero3 (#489)/2023-07-24
0
2023-07-24
fix-logging v. main
fix-logging @7e4374e/Fix logging/2023-07-05 main @5d0f04d/Update README.md (#537)/2023-07-21
0
2023-07-24
fix-logging v. main
fix-logging @7e4374e/Fix logging/2023-07-05 main @5d0f04d/Update README.md (#537)/2023-07-21
0
2023-07-24
fix-logging v. main
fix-logging @7e4374e/Fix logging/2023-07-05 main @5d0f04d/Update README.md (#537)/2023-07-21
0
2023-07-22
BoN v. main
BoN @1fda0ce/fix(accelerate_base_trainer): order of keyword arguments/2023-07-22 main @5d0f04d/Update README.md (#537)/2023-07-21
0
2023-07-22
main v. main
main @ea7c2b0/revert(accelerate_ilql_trainer): remove `shuffle` option/2023-07-22 main @5d0f04d/Update README.md (#537)/2023-07-21
0
2023-07-22
BoN v. main
BoN @1fda0ce/fix(accelerate_base_trainer): order of keyword arguments/2023-07-22 main @5d0f04d/Update README.md (#537)/2023-07-21
0
2023-07-22
BoN v. main
BoN @3d7e0d5/Fix chunked generation/2023-07-21 main @5d0f04d/Update README.md (#537)/2023-07-21
0
2023-07-22
main v. main
main @10369a1/fix(accelerate_trainer): fill a missing `shuffle` argument/2023-07-22 main @5d0f04d/Update README.md (#537)/2023-07-21
0
2023-07-22
main v. main
main @8a943d9/Reset train dataloder at each iteration/2023-07-19 main @288d4cb/Fix: rename model_tok to tokenizer is reward_fn arg (#534)/2023-07-20
0
2023-07-20
dense-reward-carper v. main
dense-reward-carper @6bc1c54/fix(ppo_randomwalks): `reward_fn` signature to accommodate tokenizer/2023-07-17 main @5c5abca/feat(readme): add instructions to avoid OOMs with hyperparameters (#470)/2023-07-13
0
2023-07-17
main v. main
main @fe33681/Fix ordering of ppo epoch iteration/2023-07-13 main @1446523/Update README.md/2023-07-10
0
2023-07-13
dense-reward-carper v. main
dense-reward-carper @442423f/Fix missing dtype in trainer rewards tensor (#520)/2023-07-10 main @b9c16f5/fix peft config cause tensorboard type error (#515)/2023-07-05
0
2023-07-10
dev_additional_special_tokens v. main
dev_additional_special_tokens @d2c4b3b/fix(accelerate_ppo_trainer): resize token embeddings without hydra/2023-06-26 main @475a284/Release: v0.7.0 (#511)/2023-06-23
0
2023-06-26
dev_additional_special_tokens v. main
dev_additional_special_tokens @12560ac/Merge branch 'main' into dev_additional_special_tokens/2023-06-26 main @475a284/Release: v0.7.0 (#511)/2023-06-23
0
2023-06-26
quantization v. main
quantization @2b2c546/ 8-bit inference (#512)/2023-06-24 main @475a284/Release: v0.7.0 (#511)/2023-06-23
0
2023-06-26
bigcode v. main
bigcode @a7e3b9c/feat(modeling_ppo): add GPTBigCodeModelBranch/2023-06-19 main @475a284/Release: v0.7.0 (#511)/2023-06-23
0
2023-06-26
dev_additional_special_tokens v. main
dev_additional_special_tokens @134bbf9/fix: move hydra heads resize_token_embeddings/2023-06-14 main @475a284/Release: v0.7.0 (#511)/2023-06-23
0
2023-06-26
fix-default-gen_kwargs v. main
fix-default-gen_kwargs @087e73d/fix(base_trainer): force pad_token regardless of architecture/2023-06-23 main @093e89a/Change logging_dir to project_dir (#506)/2023-06-17
0
2023-06-23
fix-logratio-bug v. main
fix-logratio-bug @a451c94/Shift mask for seq2seq models + remove debug print statements/2023-06-22 main @093e89a/Change logging_dir to project_dir (#506)/2023-06-17
0
2023-06-22
fix-logratio-bug v. main
fix-logratio-bug @a451c94/Shift mask for seq2seq models + remove debug print statements/2023-06-22 main @093e89a/Change logging_dir to project_dir (#506)/2023-06-17
0
2023-06-22
peft_migration v. main
peft_migration @de52db0/ILQL generate when temperature is 0./2023-06-08 main @093e89a/Change logging_dir to project_dir (#506)/2023-06-17
0
2023-06-22
peft_migration v. main
peft_migration @de52db0/ILQL generate when temperature is 0./2023-06-08 main @093e89a/Change logging_dir to project_dir (#506)/2023-06-17
0
2023-06-22
fix-ppo-zero3 v. main
fix-ppo-zero3 @717d81b/style: satisfy black/2023-06-05 main @093e89a/Change logging_dir to project_dir (#506)/2023-06-17
0
2023-06-16
fix-drop-last-in-dataloader v. main
fix-drop-last-in-dataloader @3cd05ed/style: satisfy flake/2023-05-10 main @9fdd0d7/feat(requirements.txt): upgrade dependencies (#465)/2023-05-01
0
2023-05-10
update-requirements v. main
update-requirements @3291d2e/fix(requirements.txt): relax dependencies/2023-05-01 main @a47611d/docs(base_trainer): fill in missing `prepare_learning` method (#449)/2023-04-30
0
2023-05-01
update-requirements v. main
update-requirements @8e64ad8/chore(requirements.txt): upgrade/2023-04-30 main @a47611d/docs(base_trainer): fill in missing `prepare_learning` method (#449)/2023-04-30
0
2023-04-30
main v. main
main @8540322/fix(ppo_modeling): condition passing `device` in `OPTModelBranch`/2023-04-25 main @07c962e/Create Example training scripts to run in Stability cluster (#419)/2023-04-24
0
2023-04-25
fix-kl-computation v. main
fix-kl-computation @c37aa8b/feat(ppo_trainer): log token-wise KL/2023-04-20 main @9bc0836/fix(offline_pipeline): ILQL negative indexing under truncation (#435)/2023-04-18
0
2023-04-20
fix-kl-computation v. main
fix-kl-computation @0920350/style: satisfy black/2023-04-19 main @9bc0836/fix(offline_pipeline): ILQL negative indexing under truncation (#435)/2023-04-18
0
2023-04-19
fix-save-pretrained-zero3 v. main
fix-save-pretrained-zero3 @d94aac1/style/2023-04-10 main @a66a7da/[feat] Support tying metadata to each prompt (#421)/2023-04-10
0
2023-04-10
add-prompts-metadata v. main
add-prompts-metadata @8f11cca/style: satisfy flake/2023-04-06 main @565c316/Minibatch impl (#364)/2023-04-06
0
2023-04-07
add-prompts-metadata v. main
add-prompts-metadata @8f11cca/style: satisfy flake/2023-04-06 main @565c316/Minibatch impl (#364)/2023-04-06
0
2023-04-06
add_eos_token_when_padded v. main
add_eos_token_when_padded @f219302/Ensure trailing EOS token is added also when the generated output contains trailing padding as is often the case when doing batch generation/2023-04-05 main @d7225c2/set torch dependency to version 2.0.0 for CUDA in installation instructions (#409)/2023-04-05
0
2023-04-05
ray-train-integration v. main
ray-train-integration @ec638c3/style: satisfy black/2023-03-31 main @c77e304/Nemo ILQL fixes (#404)/2023-03-30
0
2023-03-31
fix-preserve-eos-token v. main
fix-preserve-eos-token @de3b04f/fix(ppo_trainer): preserve `<eos>` and append it after trimming/2023-03-28 main @114998b/[feat] Add benchmark tools (#357)/2023-03-28
0
2023-03-28
add-benchmark-tools v. main
add-benchmark-tools @fb8cdb7/chore(README): add a link to reference runs/2023-03-26 main @086a905/[feat] Add LLaMa Model support for PPO (#375)/2023-03-27
0
2023-03-27
add-hypothesis v. main
add-hypothesis @0390597/Merge branch 'main' of github.com:CarperAI/trlx into add-hypothesis/2023-03-22 main @b0c4ea9/feat: add requirements.txt from the most recent `pip freeze` (#381)/2023-03-21
0
2023-03-23
add-hypothesis v. main
add-hypothesis @0390597/Merge branch 'main' of github.com:CarperAI/trlx into add-hypothesis/2023-03-22 main @b0c4ea9/feat: add requirements.txt from the most recent `pip freeze` (#381)/2023-03-21
0
2023-03-23
add-hypothesis v. main
add-hypothesis @0390597/Merge branch 'main' of github.com:CarperAI/trlx into add-hypothesis/2023-03-22 main @b0c4ea9/feat: add requirements.txt from the most recent `pip freeze` (#381)/2023-03-21
0
2023-03-22
add-benchmark-tools v. main
add-benchmark-tools @711fb4d/fix(benchmark): ignore git apply patch failed error/2023-03-22 main @b0c4ea9/feat: add requirements.txt from the most recent `pip freeze` (#381)/2023-03-21
0
2023-03-22
add-benchmark-tools v. main
add-benchmark-tools @711fb4d/fix(benchmark): ignore git apply patch failed error/2023-03-22 main @b0c4ea9/feat: add requirements.txt from the most recent `pip freeze` (#381)/2023-03-21
0
2023-03-22
add-benchmark-tools v. main
add-benchmark-tools @711fb4d/fix(benchmark): ignore git apply patch failed error/2023-03-22 main @b0c4ea9/feat: add requirements.txt from the most recent `pip freeze` (#381)/2023-03-21
0
2023-03-22
add-benchmark-tools v. main
add-benchmark-tools @711fb4d/fix(benchmark): ignore git apply patch failed error/2023-03-22 main @b0c4ea9/feat: add requirements.txt from the most recent `pip freeze` (#381)/2023-03-21
0
2023-03-22
half-exp-fix v. main
half-exp-fix @4b095df/Merge branch 'main' into half-exp-fix/2023-03-20 main @79bfc6b/fix(docs): Update incorrect `PPORLElement` logprob tensor shape hint (#377)/2023-03-17
0
2023-03-20
half-exp-fix v. main
half-exp-fix @1a243ae/fixes half exp not implemented error/2023-03-13 main @79bfc6b/fix(docs): Update incorrect `PPORLElement` logprob tensor shape hint (#377)/2023-03-17
0
2023-03-20
fix-total-steps-distributed v. main
fix-total-steps-distributed @c883b0f/fix(ilql_trainer): `total_steps` calculation if running distributed/2023-03-16 main @4623105/[fix] Drop `<eos>` from ILQL sample's phrases (#362)/2023-03-14
0
2023-03-16
fix-total-steps-distributed v. main
fix-total-steps-distributed @c883b0f/fix(ilql_trainer): `total_steps` calculation if running distributed/2023-03-16 main @4623105/[fix] Drop `<eos>` from ILQL sample's phrases (#362)/2023-03-14
0
2023-03-16
ray-train-integration v. main
ray-train-integration @c7ac679/fix(setup.cfg): condition ray's pinned wheel/2023-03-15 main @4623105/[fix] Drop `<eos>` from ILQL sample's phrases (#362)/2023-03-14
0
2023-03-15
ray-train-integration v. main
ray-train-integration @a7e7bb4/merge: revert to upstream changes/2023-03-14 main @4623105/[fix] Drop `<eos>` from ILQL sample's phrases (#362)/2023-03-14
0
2023-03-14
infinite_prompt_pipeline v. main
infinite_prompt_pipeline @a50cc07/Adding `infinite_dataloader` since previous solution doesn't seem to work as expected/2023-03-07 main @ded2e5e/fix(ppo_trainer): update `AdaptiveKLController` with correct KL (#361)/2023-03-13
0
2023-03-13
half-exp-fix v. main
half-exp-fix @1a243ae/fixes half exp not implemented error/2023-03-13 main @ded2e5e/fix(ppo_trainer): update `AdaptiveKLController` with correct KL (#361)/2023-03-13
0
2023-03-13
fix-ilql-sample-endings v. main
fix-ilql-sample-endings @20eec76/fix(ilql_randomwalks): bump `seq_length` to not trim any samples/2023-03-10 main @ded2e5e/fix(ppo_trainer): update `AdaptiveKLController` with correct KL (#361)/2023-03-13
0
2023-03-13
fix-kl-controller v. main
fix-kl-controller @ee8cd4a/fix(ppo_trainer): update `AdaptiveKLController` with correct KL/2023-03-08 main @adbf8fc/Add intermediate checkpointing to `accelerate` trainers (#349)/2023-03-08
0
2023-03-13
fix-kl-controller v. main
fix-kl-controller @ee8cd4a/fix(ppo_trainer): update `AdaptiveKLController` with correct KL/2023-03-08 main @adbf8fc/Add intermediate checkpointing to `accelerate` trainers (#349)/2023-03-08
0
2023-03-13
fix-ilql-sample-endings v. main
fix-ilql-sample-endings @20eec76/fix(ilql_randomwalks): bump `seq_length` to not trim any samples/2023-03-10 main @adbf8fc/Add intermediate checkpointing to `accelerate` trainers (#349)/2023-03-08
0
2023-03-10
fix-ilql-sample-endings v. main
fix-ilql-sample-endings @20eec76/fix(ilql_randomwalks): bump `seq_length` to not trim any samples/2023-03-10 main @adbf8fc/Add intermediate checkpointing to `accelerate` trainers (#349)/2023-03-08
0
2023-03-10
fix-kl-controller v. main
fix-kl-controller @ee8cd4a/fix(ppo_trainer): update `AdaptiveKLController` with correct KL/2023-03-08 main @adbf8fc/Add intermediate checkpointing to `accelerate` trainers (#349)/2023-03-08
0
2023-03-10
0
2023-03-10
fix-kl-controller v. main
fix-kl-controller @ee8cd4a/fix(ppo_trainer): update `AdaptiveKLController` with correct KL/2023-03-08 main @adbf8fc/Add intermediate checkpointing to `accelerate` trainers (#349)/2023-03-08
0
2023-03-10
fix-kl-controller v. main
fix-kl-controller @ee8cd4a/fix(ppo_trainer): update `AdaptiveKLController` with correct KL/2023-03-08 main @adbf8fc/Add intermediate checkpointing to `accelerate` trainers (#349)/2023-03-08
0
2023-03-09
fix-kl-controller v. main
fix-kl-controller@ee8cd4a/fix(ppo_trainer): update `AdaptiveKLController` with correct KL/2023-03-08 main@adbf8fc/Add intermediate checkpointing to `accelerate` trainers (#349)/2023-03-08
0
2023-03-09
fix-kl-controller v. main
ee8cd4a/fix(ppo_trainer): update `AdaptiveKLController` with correct KL/2023-03-08 adbf8fc/Add intermediate checkpointing to `accelerate` trainers (#349)/2023-03-08
0
2023-03-09
fix-kl-controller v. main
ee8cd4a/fix(ppo_trainer): update `AdaptiveKLController` with correct KL/2023-03-08 adbf8fc/Add intermediate checkpointing to `accelerate` trainers (#349)/2023-03-08
0
2023-03-09
fix-kl-controller v. main
ee8cd4a/fix(ppo_trainer): update `AdaptiveKLController` with correct KL/2023-03-08 adbf8fc/Add intermediate checkpointing to `accelerate` trainers (#349)/2023-03-08
0
2023-03-09
fix-kl-controller v. main
ee8cd4a/fix(ppo_trainer): update `AdaptiveKLController` with correct KL/2023-03-08 v. adbf8fc/Add intermediate checkpointing to `accelerate` trainers (#349)/2023-03-08
0
2023-03-09
fix-kl-controller v. main
ee8cd4a/fix(ppo_trainer): update `AdaptiveKLController` with correct KL/2023-03-08 v. adbf8fc/Add intermediate checkpointing to `accelerate` trainers (#349)/2023-03-08
0
2023-03-09
fix-kl-controller v. main
ee8cd4a/fix(ppo_trainer): update `AdaptiveKLController` with correct KL/2023-03-08 v. adbf8fc/Add intermediate checkpointing to `accelerate` trainers (#349)/2023-03-08
0
2023-03-09
fix-kl-controller v. main
ee8cd4a/fix(ppo_trainer): update `AdaptiveKLController` with correct KL/2023-03-08 v. adbf8fc/Add intermediate checkpointing to `accelerate` trainers (#349)/2023-03-08
0
2023-03-09
fix-kl-controller v. main
ee8cd4a/fix(ppo_trainer): update `AdaptiveKLController` with correct KL/2023-03-08 v. adbf8fc/Add intermediate checkpointing to `accelerate` trainers (#349)/2023-03-08
0
2023-03-09
0
2023-03-09
0
2023-03-09
0
2023-03-09
0
2023-03-09
0
2023-03-09
0
2023-03-09
0
2023-03-09
0
2023-03-09
0
2023-03-09
0
2023-03-08
0
2023-03-08
0
2023-03-06
0
2023-03-03