ColossalAI

History

Wenhao Chen 153b957a1b [chat] refactor strategy class with booster api (#3987 ) * refactor: adapt boost API in base and naive strategies * fix: initialize plugin after setup_distributed * fix: fix save_pretrained fn * refactor: adapt boost API in DDPStrategy * to: add _post_init check * to: fix ddp backward, modify ddp dataloader and unwrap * feat: adapt boost API in ColossalAIStrategy * fix: call setup_distributed before use get_current_device * fix: fix save_model and save_optimizer * test: remove save_sharded_optimizer test * style: apply formatter * fix: fix stage check and add comments * feat: allow dict type arg in strategy.prepare * to: temporarily remove lr_scheduler for testing * style: simplify init of ColossalAIStrategy * fix: fix lr_scheduler in sft and rm * style: modify comments * test: add train_prompts tests * fix: fix inference only case and use in train_prompts * test: skip failed tests in ci * style: fix CodeFactor check * fix: do not use model.to('cpu') with GeminiPlugin * test: enable colossalai_gemini tests * test: set CUDA_VISIBLE_DEVICES in ci * docs: add note		2023-06-25 17:36:21 +08:00
..
dataset	[chat] refactor actor class (#3968 )	2023-06-13 13:31:56 +08:00
experience_maker	[chat] refactor actor class (#3968 )	2023-06-13 13:31:56 +08:00
kernels	[CI] fix some spelling errors (#3707 )	2023-05-10 17:12:03 +08:00
models	[chat] refactor actor class (#3968 )	2023-06-13 13:31:56 +08:00
quant	[chat] add distributed PPO trainer (#3740 )	2023-06-07 10:41:16 +08:00
ray	fix typo applications/Chat/coati/ (#3947 )	2023-06-15 10:43:11 +08:00
replay_buffer	[chat] polish code note typo (#3612 )	2023-04-20 17:22:15 +08:00
trainer	[chat] refactor strategy class with booster api (#3987 )	2023-06-25 17:36:21 +08:00
utils	[chat] remove lm model class (#3653 )	2023-04-27 15:37:38 +08:00
__init__.py	[Coati] first commit (#3283 )	2023-03-28 20:25:36 +08:00