ColossalAI

History

Wenhao Chen b03d64d010 [chat] refactor trainer class (#4080 ) * to: add SLTrainer * refactor: refactor RMTrainer and SFTTrainer * fix: fix init file * feat: remove on_learn_epoch fn as not used * fix: align with modified gemini arguments * to: add OnPolicyTrainer * revert: add _on_learn_epoch fn * refactor: refactor PPOTrainer * style: rename PPOTrainer argument * fix: align with modified PPO arguments * test: align with modified train_prompts arguments * chore: modify train_prompts * docs: align with modified arguments * fix: remove unnecessary output * fix: move dataloader to fit fn of SLTrainer * fix: move dataloader to fit fn of OnPolicyTrainer * fix: modify usage of prompt and pretrain dataloader		2023-06-29 10:48:09 +08:00
..
dataset	[chat] refactor actor class (#3968 )	2023-06-13 13:31:56 +08:00
experience_maker	[chat] refactor actor class (#3968 )	2023-06-13 13:31:56 +08:00
kernels	[CI] fix some spelling errors (#3707 )	2023-05-10 17:12:03 +08:00
models	[chat] refactor actor class (#3968 )	2023-06-13 13:31:56 +08:00
quant	[chat] add distributed PPO trainer (#3740 )	2023-06-07 10:41:16 +08:00
ray	fix typo applications/Chat/coati/ (#3947 )	2023-06-15 10:43:11 +08:00
replay_buffer	[chat] polish code note typo (#3612 )	2023-04-20 17:22:15 +08:00
trainer	[chat] refactor trainer class (#4080 )	2023-06-29 10:48:09 +08:00
utils	[chat] remove lm model class (#3653 )	2023-04-27 15:37:38 +08:00
__init__.py	[Coati] first commit (#3283 )	2023-03-28 20:25:36 +08:00