You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/applications/ColossalChat/coati/trainer
Tong Li 4c8e85ee0d
[Coati] Train DPO using PP (#6054)
2 months ago
..
callbacks
__init__.py
base.py [ColossalChat] Add PP support (#6001) 3 months ago
dpo.py [Coati] Train DPO using PP (#6054) 2 months ago
kto.py [ColossalChat] Add PP support (#6001) 3 months ago
orpo.py [ColossalChat] Add PP support (#6001) 3 months ago
ppo.py Support overall loss, update KTO logging 4 months ago
rm.py [ColossalChat] Add PP support (#6001) 3 months ago
sft.py [ColossalChat] Add PP support (#6001) 3 months ago
utils.py [ColossalChat] Add PP support (#6001) 3 months ago