ColossalAI/applications/ColossalChat/coati/trainer
YeAnbang 0b2d55c4ab Support overall loss, update KTO logging 2024-08-02 06:51:38 +00:00
..
callbacks [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
__init__.py add kto 2024-07-18 07:54:11 +00:00
base.py [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
dpo.py Support overall loss, update KTO logging 2024-08-02 06:51:38 +00:00
kto.py Support overall loss, update KTO logging 2024-08-02 06:51:38 +00:00
orpo.py Support overall loss, update KTO logging 2024-08-02 06:51:38 +00:00
ppo.py Support overall loss, update KTO logging 2024-08-02 06:51:38 +00:00
rm.py fix eval 2024-07-11 03:35:03 +00:00
sft.py Support overall loss, update KTO logging 2024-08-02 06:51:38 +00:00
utils.py [pre-commit.ci] pre-commit autoupdate (#5572) 2024-07-01 17:16:41 +08:00