ColossalAI

Commit Graph

Author	SHA1	Message	Date
Wenhao Chen	3d8d5d0d58	[chat] use official transformers and fix some issues (#4117 ) * feat: remove on_learn_epoch fn as not used * revert: add _on_learn_epoch fn * feat: remove NaiveStrategy * test: update train_prompts tests * fix: remove prepare_llama_tokenizer_and_embedding * test: add lora arg * feat: remove roberta support in train_prompts due to runtime errs * feat: remove deberta & roberta in rm as not used * test: remove deberta and roberta tests * feat: remove deberta and roberta models as not used * fix: remove calls to roberta * fix: remove prepare_llama_tokenizer_and_embedding * chore: update transformers version * docs: update transformers version * fix: fix actor inference * fix: fix ci * feat: change llama pad token to unk * revert: revert ddp setup_distributed * fix: change llama pad token to unk * revert: undo unnecessary changes * fix: use pip to install transformers	1 year ago
Wenhao Chen	edd75a59ea	[chat] remove naive strategy and split colossalai strategy (#4094 ) * feat: remove on_learn_epoch fn as not used * revert: add _on_learn_epoch fn * to: remove the use of NaiveStrategy * test: remove NaiveStrategy tests * feat: remove NaiveStrategy * style: modify comments and params * feat: split ColossalAIStrategy into LowLevelZeroStrategy and GeminiStrategy * fix: remove naive * fix: align with modified colossal strategy * fix: fix ddp _try_init_dist arg	1 year ago
Wenhao Chen	b03d64d010	[chat] refactor trainer class (#4080 ) * to: add SLTrainer * refactor: refactor RMTrainer and SFTTrainer * fix: fix init file * feat: remove on_learn_epoch fn as not used * fix: align with modified gemini arguments * to: add OnPolicyTrainer * revert: add _on_learn_epoch fn * refactor: refactor PPOTrainer * style: rename PPOTrainer argument * fix: align with modified PPO arguments * test: align with modified train_prompts arguments * chore: modify train_prompts * docs: align with modified arguments * fix: remove unnecessary output * fix: move dataloader to fit fn of SLTrainer * fix: move dataloader to fit fn of OnPolicyTrainer * fix: modify usage of prompt and pretrain dataloader	1 year ago
github-actions[bot]	62c7e67f9f	[format] applied code formatting on changed files in pull request 3786 (#3787 ) Co-authored-by: github-actions <github-actions@github.com>	2 years ago
binmakeswell	ad2cf58f50	[chat] add performance and tutorial (#3786 )	2 years ago
digger-yu	6650daeb0a	[doc] fix chat spelling error (#3671 ) * Update README.md change "huggingaface" to "huggingface" * Update README.md change "Colossa-AI" to "Colossal-AI"	2 years ago
tanitna	1a60dc07a8	[chat] typo accimulation_steps -> accumulation_steps (#3662 )	2 years ago
Hongxin Liu	842768a174	[chat] refactor model save/load logic (#3654 ) * [chat] strategy refactor unwrap model * [chat] strategy refactor save model * [chat] add docstr * [chat] refactor trainer save model * [chat] fix strategy typing * [chat] refactor trainer save model * [chat] update readme * [chat] fix unit test	2 years ago
Yuanchen	c4709d34cf	Chat evaluate (#3608 ) Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>	2 years ago
binmakeswell	5a79cffdfd	[coati] fix install cmd (#3592 )	2 years ago
binmakeswell	535b896435	[chat] polish tutorial doc (#3551 ) * [chat] clean up duplicate tutorial * [chat] clean up duplicate tutorial * [chat] clean up duplicate tutorial * [chat] clean up duplicate tutorial	2 years ago
Yuanchen	7182ac2a04	[chat]add examples of training with limited resources in chat readme (#3536 ) Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>	2 years ago
Andrew	82132f4e3d	[chat] correcting a few obvious typos and grammars errors (#3338 )	2 years ago
Fazzie-Maqianli	0fbadce79c	[doc] added authors to the chat application (#3307 )	2 years ago
BlueRum	b512893637	Polish readme link (#3306 )	2 years ago
binmakeswell	31c78f2be3	[doc] add ColossalChat news (#3304 ) * [doc] add ColossalChat news * [doc] add ColossalChat news	2 years ago
Frank Lee	e235a24673	[application] updated the README (#3301 ) * [application] updated the README * polish code	2 years ago
github-actions[bot]	5134ad5d1a	[format] applied code formatting on changed files in pull request 3296 (#3298 ) Co-authored-by: github-actions <github-actions@github.com>	2 years ago
BlueRum	c8b723d6c2	[chat]Update Readme (#3296 ) * Update README.md * Update README.md * Update README.md * update example readme	2 years ago
Fazzie-Maqianli	a88ed0f83a	add limit (#3293 )	2 years ago
Fazzie-Maqianli	c5484281aa	[ColossalChat]add cite for datasets (#3292 )	2 years ago
Fazzie-Maqianli	ec7af22a43	fix image (#3288 )	2 years ago
Fazzie-Maqianli	1f7d9afbf8	add example (#3286 )	2 years ago
Fazzie-Maqianli	b0ce5a1032	[Coati] first commit (#3283 )	2 years ago

24 Commits (8af29ee47a274b7bc416a41732bd59208bc3d92c)