ColossalAI/applications/Chat/coati/trainer
tingfeng cao 7788e0b0a5
fix: fix sft (#3568)
2023-04-17 16:47:44 +08:00
..
callbacks
strategies fix save_model inin naive and ddp strategy (#3436) 2023-04-04 15:30:01 +08:00
__init__.py
base.py
ppo.py [chat]: add vf_coef argument for PPOTrainer (#3318) 2023-04-11 09:54:59 +08:00
rm.py
sft.py fix: fix sft (#3568) 2023-04-17 16:47:44 +08:00
utils.py [chatgpt] Detached PPO Training (#3195) 2023-04-17 14:46:50 +08:00