ColossalAI/applications/ColossalChat/coati/trainer
wangbluo 4cf79fa275 merge 2024-08-17 09:34:18 +00:00
..
callbacks [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
__init__.py [FP8] rebase main (#5963) 2024-08-06 16:29:37 +08:00
base.py [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
dpo.py merge 2024-08-17 09:34:18 +00:00
kto.py merge 2024-08-17 09:34:18 +00:00
orpo.py merge 2024-08-17 09:34:18 +00:00
ppo.py Support overall loss, update KTO logging 2024-08-02 06:51:38 +00:00
rm.py [FP8] rebase main (#5963) 2024-08-06 16:29:37 +08:00
sft.py [FP8] rebase main (#5963) 2024-08-06 16:29:37 +08:00
utils.py [pre-commit.ci] pre-commit autoupdate (#5572) 2024-07-01 17:16:41 +08:00