ColossalAI

Commit Graph

Author	SHA1	Message	Date
MisterLin1995	f7361ee1bd	[chat] fix community example ray (#3719 ) Co-authored-by: jiangwen <zxl265370@antgroup.com>	2023-05-10 13:36:09 +08:00
zhang-yi-chi	2da5d81dec	[chat] fix train_prompts.py gemini strategy bug (#3666 ) * fix gemini strategy bug * add comment * add comment * better solution	2023-05-06 16:46:38 +08:00
digger-yu	65bdc3159f	fix some spelling error with applications/Chat/examples/ (#3692 ) * fix spelling error with examples/comminity/ * fix spelling error with example/	2023-05-06 11:27:23 +08:00
Camille Zhong	0f785cb1f3	[chat] PPO stage3 doc enhancement (#3679 ) * Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci Update test_ci.sh Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci Update test_ci.sh Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. update roberta with coati chat ci update Revert "chat ci update" This reverts commit 17ae7ae01fa752bd3289fc39069868fde99cf846. * Update README.md Update README.md * update readme * Update test_ci.sh * update readme and add a script update readme and add a script modify readme Update README.md	2023-05-05 13:36:56 +08:00
digger-yu	6650daeb0a	[doc] fix chat spelling error (#3671 ) * Update README.md change "huggingaface" to "huggingface" * Update README.md change "Colossa-AI" to "Colossal-AI"	2023-05-05 11:37:35 +08:00
tanitna	1a60dc07a8	[chat] typo accimulation_steps -> accumulation_steps (#3662 )	2023-04-28 15:42:57 +08:00
binmakeswell	268b3cd80d	[chat] set default zero2 strategy (#3667 ) * [chat] set default gemini strategy * [chat] set default zero2 strategy * [chat] set default zero2 strategy	2023-04-28 13:56:50 +08:00
Hongxin Liu	842768a174	[chat] refactor model save/load logic (#3654 ) * [chat] strategy refactor unwrap model * [chat] strategy refactor save model * [chat] add docstr * [chat] refactor trainer save model * [chat] fix strategy typing * [chat] refactor trainer save model * [chat] update readme * [chat] fix unit test	2023-04-27 18:41:49 +08:00
Hongxin Liu	6ef7011462	[chat] remove lm model class (#3653 ) * [chat] refactor lora * [chat] remove lm class * [chat] refactor save model * [chat] refactor train sft * [chat] fix ci * [chat] fix ci	2023-04-27 15:37:38 +08:00
Camille Zhong	8bccb72c8d	[Doc] enhancement on README.md for chat examples (#3646 ) * Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci Update test_ci.sh Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci Update test_ci.sh Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. update roberta with coati chat ci update Revert "chat ci update" This reverts commit 17ae7ae01fa752bd3289fc39069868fde99cf846. * Update README.md Update README.md * update readme * Update test_ci.sh	2023-04-27 14:26:19 +08:00
Hongxin Liu	2a951955ad	[chat] refactor trainer (#3648 ) * [chat] ppo trainer remove useless args * [chat] update examples * [chat] update benchmark * [chat] update examples * [chat] fix sft training with wandb * [chat] polish docstr	2023-04-26 18:11:49 +08:00
zhang-yi-chi	739cfe3360	[chat] fix enable single gpu training bug	2023-04-22 14:16:08 +08:00
digger-yu	d7bf284706	[chat] polish code note typo (#3612 )	2023-04-20 17:22:15 +08:00
Yuanchen	1ec0d386a9	reconstruct chat trainer and fix training script (#3588 ) Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>	2023-04-18 16:44:03 +08:00
Camille Zhong	36a519b49f	Update test_ci.sh update Update test_ci.sh Update test_ci.sh Update test_ci.sh Update test_ci.sh Update test_ci.sh Update test_ci.sh Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update test_ci.sh Update test_ci.sh update Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml update ci Update test_ci.sh Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update test_ci.sh Update test_ci.sh Update run_chatgpt_examples.yml Update test_ci.sh Update test_ci.sh Update test_ci.sh update test ci RoBERTa for RLHF Stage 2 & 3 (still in testing) Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci Update test_ci.sh Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci Update test_ci.sh Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. update roberta with coati chat ci update Revert "chat ci update" This reverts commit 17ae7ae01fa752bd3289fc39069868fde99cf846. [test]chat_update_ci Update test_ci.sh Update test_ci.sh test Update gpt_critic.py Update gpt_critic.py Update run_chatgpt_unit_tests.yml update test ci update update update update Update test_ci.sh update Update test_ci.sh Update test_ci.sh Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml	2023-04-18 14:33:12 +08:00
tingfeng cao	7788e0b0a5	fix: fix sft (#3568 )	2023-04-17 16:47:44 +08:00
Fazzie-Maqianli	6b1a39b17b	[coati] add costom model suppor tguide (#3579 )	2023-04-17 15:40:41 +08:00
binmakeswell	cc1eec2f53	[chat] update reward model sh (#3578 )	2023-04-17 15:02:55 +08:00
csric	e355144375	[chatgpt] Detached PPO Training (#3195 ) * run the base * working on dist ppo * sync * detached trainer * update detached trainer. no maker update function * facing init problem * 1 maker 1 trainer detached run. but no model update * facing cuda problem * fix save functions * verified maker update * nothing * add ignore * analyize loss issue * remove some debug codes * facing 2m1t stuck issue * 2m1t verified * do not use torchrun * working on 2m2t * working on 2m2t * initialize strategy in ray actor env * facing actor's init order issue * facing ddp model update issue (need unwarp ddp) * unwrap ddp actor * checking 1m2t stuck problem * nothing * set timeout for trainer choosing. It solves the stuck problem! * delete some debug output * rename to sync with upstream * rename to sync with upstream * coati rename * nothing * I am going to detach the replaybuffer from trainer and make it a Ray Actor. Two benefits: 1. support TP trainer. 2. asynchronized buffer operations * experience_maker_holder performs target-revolving _send_experience() instead of length comparison. * move code to ray subfolder * working on pipeline inference * apply comments --------- Co-authored-by: csric <richcsr256@gmail.com>	2023-04-17 14:46:50 +08:00
MisterLin1995	1a809eddaa	[chat] ChatGPT train prompts on ray example (#3309 ) * [feat][chatgpt]train prompts on ray example * [fix]simplify code * [fix]remove depreciated parameter * [fix]add dependencies * [fix]method calling * [fix]experience maker * [fix]missing loss function * [fix]init optimizer * [feat]add usage comment * [fix]rename files * [fix]add readme * [fix]file path * [fix]move directory --------- Co-authored-by: jiangwen <zxl265370@antgroup.com>	2023-04-13 18:18:36 +08:00
ver217	89fd10a1c9	[chat] add zero2 cpu strategy for sft training (#3520 )	2023-04-10 19:00:13 +08:00
NatalieC323	635d0a1baf	[Chat Community] Update README.md (fixed#3487) (#3506 ) * Update README.md * Update README.md * Update README.md * Update README.md --------- Co-authored-by: Fazzie-Maqianli <55798671+Fazziekey@users.noreply.github.com>	2023-04-10 14:36:39 +08:00
binmakeswell	891b8e7fac	[chat] fix stage3 PPO sample sh command (#3477 )	2023-04-06 18:08:16 +08:00
Fazzie-Maqianli	6afeb1202a	add community example dictionary (#3465 )	2023-04-06 15:04:48 +08:00
YY Lin	62f4e2eb07	[Chat]Add Peft support & fix the ptx bug (#3433 ) * Update ppo.py Fix the bug of fetching wrong batch data * Add peft model support in SFT and Prompts training In stage-1 and stage-3, the peft model supports are added. So the trained artifacts will be only a small lora additions instead of the whole bunch of files. * Delete test_prompts.txt * Delete test_pretrained.txt * Move the peft stuffs to a community folder. * Move the demo sft to community * delete dirty files * Add instructions to install peft using source * Remove Chinese comments * remove the Chinese comments	2023-04-06 11:54:52 +08:00
kingkingofall	57a3c4db6d	[chat]fix readme (#3429 ) * fix stage 2 fix stage 2 * add torch	2023-04-06 10:58:53 +08:00
Camille Zhong	72cb4dd433	[Chat] fix the tokenizer "int too big to convert" error in SFT training (#3453 ) * Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) * Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. * Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci * Update test_ci.sh * Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. * Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) * Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. * Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci * Update test_ci.sh * Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. * update roberta with coati * chat ci update * Revert "chat ci update" This reverts commit 17ae7ae01fa752bd3289fc39069868fde99cf846. * [Chat] fix the tokenizer "int too big to convert" error in SFT training fix the tokenizer error during SFT training using Bloom and OPT	2023-04-06 09:30:28 +08:00
Camille Zhong	30412866e0	[chatgpt] add pre-trained model RoBERTa for RLHF stage 2 & 3 (#3223 ) * Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) * Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. * Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci * add test for reward model training * Update test_ci.sh * Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. * Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) * Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. * Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci * Update test_ci.sh * Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. * update roberta with coati	2023-04-03 10:11:03 +08:00
github-actions[bot]	cb413ccf28	[format] applied code formatting on changed files in pull request 3300 (#3302 ) Co-authored-by: github-actions <github-actions@github.com>	2023-03-29 09:28:24 +08:00
BlueRum	8257e1055d	[chat]polish prompts training (#3300 ) * polish train_prompts * polish readme	2023-03-29 08:44:16 +08:00
github-actions[bot]	5134ad5d1a	[format] applied code formatting on changed files in pull request 3296 (#3298 ) Co-authored-by: github-actions <github-actions@github.com>	2023-03-29 02:35:40 +08:00
BlueRum	c8b723d6c2	[chat]Update Readme (#3296 ) * Update README.md * Update README.md * Update README.md * update example readme	2023-03-29 02:32:17 +08:00
Fazzie-Maqianli	b0ce5a1032	[Coati] first commit (#3283 )	2023-03-28 20:25:36 +08:00

33 Commits (f7361ee1bd31e57004d28418133e3714b08a53b2)