ColossalAI/applications/Chat/coati/trainer
tingfeng cao 7788e0b0a5
fix: fix sft (#3568)
2023-04-17 16:47:44 +08:00
..
callbacks [Coati] first commit (#3283) 2023-03-28 20:25:36 +08:00
strategies fix save_model inin naive and ddp strategy (#3436) 2023-04-04 15:30:01 +08:00
__init__.py [Coati] first commit (#3283) 2023-03-28 20:25:36 +08:00
base.py [Coati] first commit (#3283) 2023-03-28 20:25:36 +08:00
ppo.py [chat]: add vf_coef argument for PPOTrainer (#3318) 2023-04-11 09:54:59 +08:00
rm.py [Coati] first commit (#3283) 2023-03-28 20:25:36 +08:00
sft.py fix: fix sft (#3568) 2023-04-17 16:47:44 +08:00
utils.py [chatgpt] Detached PPO Training (#3195) 2023-04-17 14:46:50 +08:00