ColossalAI

Commit Graph

Author	SHA1	Message	Date
Camille Zhong	8bccb72c8d	[Doc] enhancement on README.md for chat examples (#3646 ) * Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci Update test_ci.sh Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci Update test_ci.sh Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. update roberta with coati chat ci update Revert "chat ci update" This reverts commit 17ae7ae01fa752bd3289fc39069868fde99cf846. * Update README.md Update README.md * update readme * Update test_ci.sh	2023-04-27 14:26:19 +08:00
Hongxin Liu	2a951955ad	[chat] refactor trainer (#3648 ) * [chat] ppo trainer remove useless args * [chat] update examples * [chat] update benchmark * [chat] update examples * [chat] fix sft training with wandb * [chat] polish docstr	2023-04-26 18:11:49 +08:00
Hongxin Liu	f8288315d9	[chat] polish performance evaluator (#3647 )	2023-04-26 17:34:59 +08:00
Hongxin Liu	50793b35f4	[gemini] accelerate inference (#3641 ) * [gemini] support don't scatter after inference * [chat] update colossalai strategy * [chat] fix opt benchmark * [chat] update opt benchmark * [gemini] optimize inference * [test] add gemini inference test * [chat] fix unit test ci * [chat] fix ci * [chat] fix ci * [chat] skip checkpoint test	2023-04-26 16:32:40 +08:00
Tong Li	e1b0a78afa	Merge pull request #3621 from zhang-yi-chi/fix/chat-train-prompts-single-gpu [chat] fix single gpu training bug in examples/train_prompts.py	2023-04-24 22:13:54 +08:00
ddobokki	df309fc6ab	[Chat] Remove duplicate functions (#3625 )	2023-04-24 12:23:15 +08:00
zhang-yi-chi	739cfe3360	[chat] fix enable single gpu training bug	2023-04-22 14:16:08 +08:00
digger-yu	d7bf284706	[chat] polish code note typo (#3612 )	2023-04-20 17:22:15 +08:00
Yuanchen	c4709d34cf	Chat evaluate (#3608 ) Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>	2023-04-20 11:12:24 +08:00
binmakeswell	5a79cffdfd	[coati] fix install cmd (#3592 )	2023-04-18 18:19:48 +08:00
Yuanchen	1ec0d386a9	reconstruct chat trainer and fix training script (#3588 ) Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>	2023-04-18 16:44:03 +08:00
Camille Zhong	36a519b49f	Update test_ci.sh update Update test_ci.sh Update test_ci.sh Update test_ci.sh Update test_ci.sh Update test_ci.sh Update test_ci.sh Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update test_ci.sh Update test_ci.sh update Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml update ci Update test_ci.sh Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml Update test_ci.sh Update test_ci.sh Update run_chatgpt_examples.yml Update test_ci.sh Update test_ci.sh Update test_ci.sh update test ci RoBERTa for RLHF Stage 2 & 3 (still in testing) Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci Update test_ci.sh Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci Update test_ci.sh Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. update roberta with coati chat ci update Revert "chat ci update" This reverts commit 17ae7ae01fa752bd3289fc39069868fde99cf846. [test]chat_update_ci Update test_ci.sh Update test_ci.sh test Update gpt_critic.py Update gpt_critic.py Update run_chatgpt_unit_tests.yml update test ci update update update update Update test_ci.sh update Update test_ci.sh Update test_ci.sh Update run_chatgpt_examples.yml Update run_chatgpt_examples.yml	2023-04-18 14:33:12 +08:00
tingfeng cao	7788e0b0a5	fix: fix sft (#3568 )	2023-04-17 16:47:44 +08:00
Fazzie-Maqianli	6b1a39b17b	[coati] add costom model suppor tguide (#3579 )	2023-04-17 15:40:41 +08:00
binmakeswell	cc1eec2f53	[chat] update reward model sh (#3578 )	2023-04-17 15:02:55 +08:00
csric	e355144375	[chatgpt] Detached PPO Training (#3195 ) * run the base * working on dist ppo * sync * detached trainer * update detached trainer. no maker update function * facing init problem * 1 maker 1 trainer detached run. but no model update * facing cuda problem * fix save functions * verified maker update * nothing * add ignore * analyize loss issue * remove some debug codes * facing 2m1t stuck issue * 2m1t verified * do not use torchrun * working on 2m2t * working on 2m2t * initialize strategy in ray actor env * facing actor's init order issue * facing ddp model update issue (need unwarp ddp) * unwrap ddp actor * checking 1m2t stuck problem * nothing * set timeout for trainer choosing. It solves the stuck problem! * delete some debug output * rename to sync with upstream * rename to sync with upstream * coati rename * nothing * I am going to detach the replaybuffer from trainer and make it a Ray Actor. Two benefits: 1. support TP trainer. 2. asynchronized buffer operations * experience_maker_holder performs target-revolving _send_experience() instead of length comparison. * move code to ray subfolder * working on pipeline inference * apply comments --------- Co-authored-by: csric <richcsr256@gmail.com>	2023-04-17 14:46:50 +08:00
MisterLin1995	1a809eddaa	[chat] ChatGPT train prompts on ray example (#3309 ) * [feat][chatgpt]train prompts on ray example * [fix]simplify code * [fix]remove depreciated parameter * [fix]add dependencies * [fix]method calling * [fix]experience maker * [fix]missing loss function * [fix]init optimizer * [feat]add usage comment * [fix]rename files * [fix]add readme * [fix]file path * [fix]move directory --------- Co-authored-by: jiangwen <zxl265370@antgroup.com>	2023-04-13 18:18:36 +08:00
binmakeswell	535b896435	[chat] polish tutorial doc (#3551 ) * [chat] clean up duplicate tutorial * [chat] clean up duplicate tutorial * [chat] clean up duplicate tutorial * [chat] clean up duplicate tutorial	2023-04-13 18:11:48 +08:00
Yuanchen	7182ac2a04	[chat]add examples of training with limited resources in chat readme (#3536 ) Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>	2023-04-12 15:47:09 +08:00
zhang-yi-chi	e6a132a449	[chat]: add vf_coef argument for PPOTrainer (#3318 )	2023-04-11 09:54:59 +08:00
ver217	89fd10a1c9	[chat] add zero2 cpu strategy for sft training (#3520 )	2023-04-10 19:00:13 +08:00
binmakeswell	990d4c3e4e	[doc] hide diffusion in application path (#3519 ) - [ ] Stable Diffusion - [ ] Dreambooth It's easy for users to think that we don't support them yet. Add them after migrating them from example to application https://github.com/hpcaitech/ColossalAI/tree/main/examples/images	2023-04-10 17:52:24 +08:00
binmakeswell	0c0455700f	[doc] add requirement and highlight application (#3516 ) * [doc] add requirement and highlight application * [doc] link example and application	2023-04-10 17:37:16 +08:00
NatalieC323	635d0a1baf	[Chat Community] Update README.md (fixed#3487) (#3506 ) * Update README.md * Update README.md * Update README.md * Update README.md --------- Co-authored-by: Fazzie-Maqianli <55798671+Fazziekey@users.noreply.github.com>	2023-04-10 14:36:39 +08:00
gongenlei	a7ca297281	[coati] Fix LlamaCritic (#3475 ) * mv LlamaForCausalLM to LlamaModel * rm unused imports --------- Co-authored-by: gongenlei <gongenlei@baidu.com>	2023-04-07 11:39:09 +08:00
binmakeswell	891b8e7fac	[chat] fix stage3 PPO sample sh command (#3477 )	2023-04-06 18:08:16 +08:00
Fazzie-Maqianli	6afeb1202a	add community example dictionary (#3465 )	2023-04-06 15:04:48 +08:00
Frank Lee	80eba05b0a	[test] refactor tests with spawn (#3452 ) * [test] added spawn decorator * polish code * polish code * polish code * polish code * polish code * polish code	2023-04-06 14:51:35 +08:00
YY Lin	62f4e2eb07	[Chat]Add Peft support & fix the ptx bug (#3433 ) * Update ppo.py Fix the bug of fetching wrong batch data * Add peft model support in SFT and Prompts training In stage-1 and stage-3, the peft model supports are added. So the trained artifacts will be only a small lora additions instead of the whole bunch of files. * Delete test_prompts.txt * Delete test_pretrained.txt * Move the peft stuffs to a community folder. * Move the demo sft to community * delete dirty files * Add instructions to install peft using source * Remove Chinese comments * remove the Chinese comments	2023-04-06 11:54:52 +08:00
Dr-Corgi	73afb63594	[chat]fix save_model(#3377 ) The function save_model should be a part of PPOTrainer.	2023-04-06 11:19:14 +08:00
kingkingofall	57a3c4db6d	[chat]fix readme (#3429 ) * fix stage 2 fix stage 2 * add torch	2023-04-06 10:58:53 +08:00
Camille Zhong	72cb4dd433	[Chat] fix the tokenizer "int too big to convert" error in SFT training (#3453 ) * Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) * Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. * Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci * Update test_ci.sh * Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. * Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) * Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. * Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci * Update test_ci.sh * Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. * update roberta with coati * chat ci update * Revert "chat ci update" This reverts commit 17ae7ae01fa752bd3289fc39069868fde99cf846. * [Chat] fix the tokenizer "int too big to convert" error in SFT training fix the tokenizer error during SFT training using Bloom and OPT	2023-04-06 09:30:28 +08:00
Yuanchen	b92313903f	fix save_model indent error in ppo trainer (#3450 ) Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>	2023-04-05 09:45:42 +08:00
Yuanchen	773955abfa	fix save_model inin naive and ddp strategy (#3436 ) Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>	2023-04-04 15:30:01 +08:00
ver217	26b7aac0be	[zero] reorganize zero/gemini folder structure (#3424 ) * [zero] refactor low-level zero folder structure * [zero] fix legacy zero import path * [zero] fix legacy zero import path * [zero] remove useless import * [zero] refactor gemini folder structure * [zero] refactor gemini folder structure * [zero] refactor legacy zero import path * [zero] refactor gemini folder structure * [zero] refactor gemini folder structure * [zero] refactor gemini folder structure * [zero] refactor legacy zero import path * [zero] fix test import path * [zero] fix test * [zero] fix circular import * [zero] update import	2023-04-04 13:48:16 +08:00
Yuanchen	b09adff724	[chat]fix sft training for bloom, gpt and opt (#3418 ) fix sft training for bloom, gpt and opt	2023-04-04 09:46:23 +08:00
Camille Zhong	30412866e0	[chatgpt] add pre-trained model RoBERTa for RLHF stage 2 & 3 (#3223 ) * Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) * Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. * Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci * add test for reward model training * Update test_ci.sh * Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. * Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) * Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. * Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci * Update test_ci.sh * Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. * update roberta with coati	2023-04-03 10:11:03 +08:00
Andrew	82132f4e3d	[chat] correcting a few obvious typos and grammars errors (#3338 )	2023-03-30 14:18:37 +08:00
Fazzie-Maqianli	0fbadce79c	[doc] added authors to the chat application (#3307 )	2023-03-29 11:04:30 +08:00
BlueRum	b512893637	Polish readme link (#3306 )	2023-03-29 10:25:50 +08:00
github-actions[bot]	cb413ccf28	[format] applied code formatting on changed files in pull request 3300 (#3302 ) Co-authored-by: github-actions <github-actions@github.com>	2023-03-29 09:28:24 +08:00
binmakeswell	31c78f2be3	[doc] add ColossalChat news (#3304 ) * [doc] add ColossalChat news * [doc] add ColossalChat news	2023-03-29 09:27:55 +08:00
Frank Lee	e235a24673	[application] updated the README (#3301 ) * [application] updated the README * polish code	2023-03-29 08:47:00 +08:00
BlueRum	8257e1055d	[chat]polish prompts training (#3300 ) * polish train_prompts * polish readme	2023-03-29 08:44:16 +08:00
ver217	62f7156131	[coati] fix inference profanity check (#3299 )	2023-03-29 04:26:35 +08:00
github-actions[bot]	5134ad5d1a	[format] applied code formatting on changed files in pull request 3296 (#3298 ) Co-authored-by: github-actions <github-actions@github.com>	2023-03-29 02:35:40 +08:00
BlueRum	c8b723d6c2	[chat]Update Readme (#3296 ) * Update README.md * Update README.md * Update README.md * update example readme	2023-03-29 02:32:17 +08:00
ver217	73b542a124	[coati] inference supports profanity check (#3295 )	2023-03-29 02:14:35 +08:00
ver217	ce2cafae76	[coati] add repetition_penalty for inference (#3294 )	2023-03-29 01:18:45 +08:00
Fazzie-Maqianli	a88ed0f83a	add limit (#3293 )	2023-03-29 00:53:23 +08:00

1 2 3

106 Commits (8bccb72c8d6b4ff21d3d596f0188c6280d8b29f6)