You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/applications/ColossalChat/examples/training_scripts
YeAnbang 09d5ffca1a
add kto
4 months ago
..
hostfile add SimPO 5 months ago
train_dpo.py fix orpo cross entropy loss 5 months ago
train_dpo.sh add orpo 5 months ago
train_kto.py add kto 4 months ago
train_kto.sh add kto 4 months ago
train_orpo.py fix orpo cross entropy loss 5 months ago
train_orpo.sh add orpo 5 months ago
train_ppo.py replace the customized dataloader setup with the build-in one 6 months ago
train_ppo.sh [ColossalChat] Update RLHF V2 (#5286) 8 months ago
train_rm.py fix orpo cross entropy loss 5 months ago
train_rm.sh add kto 4 months ago
train_sft.py fix orpo cross entropy loss 5 months ago
train_sft.sh add kto 4 months ago