ColossalAI/applications/ColossalChat/coati/trainer
YeAnbang 544b7a38a1 fix style, add kto data sample 2024-07-18 08:38:56 +00:00
..
callbacks [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
__init__.py add kto 2024-07-18 07:54:11 +00:00
base.py [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
dpo.py fix eval 2024-07-11 03:35:03 +00:00
kto.py fix style, add kto data sample 2024-07-18 08:38:56 +00:00
orpo.py fix orpo cross entropy loss 2024-07-15 02:12:05 +00:00
ppo.py [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
rm.py fix eval 2024-07-11 03:35:03 +00:00
sft.py fix eval 2024-07-11 03:35:03 +00:00
utils.py [pre-commit.ci] pre-commit autoupdate (#5572) 2024-07-01 17:16:41 +08:00