ColossalAI/applications/Chat/coati
Wenhao Chen b03d64d010
[chat] refactor trainer class (#4080)
* to: add SLTrainer

* refactor: refactor RMTrainer and SFTTrainer

* fix: fix init file

* feat: remove on_learn_epoch fn as not used

* fix: align with modified gemini arguments

* to: add OnPolicyTrainer

* revert: add _on_learn_epoch fn

* refactor: refactor PPOTrainer

* style: rename PPOTrainer argument

* fix: align with modified PPO arguments

* test: align with modified train_prompts arguments

* chore: modify train_prompts

* docs: align with modified arguments

* fix: remove unnecessary output

* fix: move dataloader to fit fn of SLTrainer

* fix: move dataloader to fit fn of OnPolicyTrainer

* fix: modify usage of prompt and pretrain dataloader
2023-06-29 10:48:09 +08:00
..
dataset [chat] refactor actor class (#3968) 2023-06-13 13:31:56 +08:00
experience_maker [chat] refactor actor class (#3968) 2023-06-13 13:31:56 +08:00
kernels [CI] fix some spelling errors (#3707) 2023-05-10 17:12:03 +08:00
models [chat] refactor actor class (#3968) 2023-06-13 13:31:56 +08:00
quant [chat] add distributed PPO trainer (#3740) 2023-06-07 10:41:16 +08:00
ray fix typo applications/Chat/coati/ (#3947) 2023-06-15 10:43:11 +08:00
replay_buffer [chat] polish code note typo (#3612) 2023-04-20 17:22:15 +08:00
trainer [chat] refactor trainer class (#4080) 2023-06-29 10:48:09 +08:00
utils [chat] remove lm model class (#3653) 2023-04-27 15:37:38 +08:00
__init__.py [Coati] first commit (#3283) 2023-03-28 20:25:36 +08:00