ColossalAI/applications/ColossalChat/examples/training_scripts
pre-commit-ci[bot] df612434c9 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2024-06-14 16:27:46 +08:00
..
hostfile [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
train_dpo.py [pre-commit.ci] auto fixes from pre-commit.com hooks 2024-06-14 16:27:46 +08:00
train_dpo.sh [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
train_ppo.py [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
train_ppo.sh [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
train_rm.py [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
train_rm.sh [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
train_sft.py [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
train_sft.sh [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00