ColossalAI/applications/Chat/coati/trainer/strategies
Wenhao Chen b03d64d010
[chat] refactor trainer class (#4080)
* to: add SLTrainer

* refactor: refactor RMTrainer and SFTTrainer

* fix: fix init file

* feat: remove on_learn_epoch fn as not used

* fix: align with modified gemini arguments

* to: add OnPolicyTrainer

* revert: add _on_learn_epoch fn

* refactor: refactor PPOTrainer

* style: rename PPOTrainer argument

* fix: align with modified PPO arguments

* test: align with modified train_prompts arguments

* chore: modify train_prompts

* docs: align with modified arguments

* fix: remove unnecessary output

* fix: move dataloader to fit fn of SLTrainer

* fix: move dataloader to fit fn of OnPolicyTrainer

* fix: modify usage of prompt and pretrain dataloader
2023-06-29 10:48:09 +08:00
..
__init__.py [Coati] first commit (#3283) 2023-03-28 20:25:36 +08:00
base.py [chat] refactor strategy class with booster api (#3987) 2023-06-25 17:36:21 +08:00
colossalai.py [chat] refactor trainer class (#4080) 2023-06-29 10:48:09 +08:00
ddp.py [chat] refactor strategy class with booster api (#3987) 2023-06-25 17:36:21 +08:00
naive.py [chat] refactor strategy class with booster api (#3987) 2023-06-25 17:36:21 +08:00
sampler.py [chat] add distributed PPO trainer (#3740) 2023-06-07 10:41:16 +08:00