ColossalAI/applications/ColossalChat/examples/training_scripts
YeAnbang f3de5a025c remove debug code 2024-06-24 05:16:29 +00:00
..
hostfile add SimPO 2024-06-24 02:12:20 +00:00
train_dpo.py add SimPO 2024-06-24 02:12:20 +00:00
train_dpo.sh fix dataloader 2024-06-24 05:10:44 +00:00
train_ppo.py replace the customized dataloader setup with the build-in one 2024-06-07 09:43:42 +00:00
train_ppo.sh [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
train_rm.py replace the customized dataloader setup with the build-in one 2024-06-07 09:43:42 +00:00
train_rm.sh [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
train_sft.py add SimPO 2024-06-24 02:12:20 +00:00
train_sft.sh remove debug code 2024-06-24 05:16:29 +00:00