ColossalAI

Commit Graph

Author	SHA1	Message	Date
ver217	4905b21b94	[coati] fix inference output (#3285 ) * [coati] fix inference requirements * [coati] add output postprocess * [coati] update inference readme * [coati] fix inference requirements	2 years ago
Fazzie-Maqianli	bb6196e71a	remove chatgpt (#3284 )	2 years ago
Fazzie-Maqianli	b0ce5a1032	[Coati] first commit (#3283 )	2 years ago
binmakeswell	d32ef94ad9	[doc] fix typo (#3222 ) * [doc] fix typo * [doc] fix typo	2 years ago
ver217	78fd31f9c1	[chatgpt] add precision option for colossalai (#3233 )	2 years ago
Fazzie-Maqianli	bd39877da4	support instrcut training (#3230 )	2 years ago
Camille Zhong	9bc702ab48	[doc] update chatgpt doc paper link (#3229 ) #issue 3189	2 years ago
Fazzie-Maqianli	bbac6760e5	fix torch version (#3225 )	2 years ago
Fazzie-Maqianli	fa97a9cab4	[chatgpt] unnify datasets (#3218 )	2 years ago
Fazzie-Maqianli	4fd4bd9d9a	[chatgpt] support instuct training (#3216 )	2 years ago
Yuanchen	9998d5ef64	[chatgpt]add reward model code for deberta (#3199 ) Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>	2 years ago
Fazzie-Maqianli	1e1b9d2fea	[chatgpt]support llama (#3070 )	2 years ago
pgzhang	b429529365	[chatgpt] add supervised learning fine-tune code (#3183 ) * [chatgpt] add supervised fine-tune code * [chatgpt] delete unused code and modified comment code * [chatgpt] use pytorch distributed sampler instead --------- Co-authored-by: zhangpengpeng <zhangpengpeng@joyy.com>	2 years ago
BlueRum	7548ca5a54	[chatgpt]Reward Model Training Process update (#3133 ) * add normalize function to value_head in bloom rm * add normalization to value_function in gpt_rm * add normalization to value_head of opt_rm * add Anthropic/hh-rlhf dataset * Update __init__.py * Add LogExpLoss in RM training * Update __init__.py * update rm trainer to use acc as target * update example/train_rm * Update train_rm.sh * code style * Update README.md * Update README.md * add rm test to ci * fix tokenier * fix typo * change batchsize to avoid oom in ci * Update test_ci.sh	2 years ago
ver217	1e58d31bb7	[chatgpt] fix trainer generate kwargs (#3166 )	2 years ago
ver217	c474fda282	[chatgpt] fix ppo training hanging problem with gemini (#3162 ) * [chatgpt] fix generation early stopping * [chatgpt] fix train prompts example	2 years ago
binmakeswell	3c01280a56	[doc] add community contribution guide (#3153 ) * [doc] update contribution guide * [doc] update contribution guide * [doc] add community contribution guide	2 years ago
BlueRum	23cd5e2ccf	[chatgpt]update ci (#3087 ) * [chatgpt]update ci * Update test_ci.sh * Update test_ci.sh * Update test_ci.sh * test * Update train_prompts.py * Update train_dummy.py * add save_path * polish * add save path * polish * add save path * polish * delete bloom-560m test delete bloom-560m test because of oom * add ddp test	2 years ago
BlueRum	68577fbc43	[chatgpt]Fix examples (#3116 ) * fix train_dummy * fix train-prompts	2 years ago
BlueRum	0672b5afac	[chatgpt] fix lora support for gpt (#3113 ) * fix gpt-actor * fix gpt-critic * fix opt-critic	2 years ago
hiko2MSP	191daf7411	[chatgpt] type miss of kwargs (#3107 )	2 years ago
BlueRum	c9dd036592	[chatgpt] fix lora save bug (#3099 ) * fix colo-stratergy * polish * fix lora * fix ddp * polish * polish	2 years ago
Fazzie-Maqianli	02ae80bf9c	[chatgpt]add flag of action mask in critic(#3086 )	2 years ago
wenjunyang	b51bfec357	[chatgpt] change critic input as state (#3042 ) * fix Critic * fix Critic * fix Critic * fix neglect of attention mask * fix neglect of attention mask * fix neglect of attention mask * add return --------- Co-authored-by: yangwenjun <yangwenjun@soyoung.com> Co-authored-by: yangwjd <yangwjd@chanjet.com>	2 years ago
Fazzie-Maqianli	c21b11edce	change nn to models (#3032 )	2 years ago
github-actions[bot]	e86d9bb2e1	[format] applied code formatting on changed files in pull request 3025 (#3026 ) Co-authored-by: github-actions <github-actions@github.com>	2 years ago
BlueRum	55dcd3051a	[chatgpt] fix readme (#3025 )	2 years ago
LuGY	287d60499e	[chatgpt] Add saving ckpt callback for PPO (#2880 ) * add checkpoint callback for chatgpt * add save ckpt callbacks for ppo --------- Co-authored-by: Fazzie-Maqianli <55798671+Fazziekey@users.noreply.github.com>	2 years ago
BlueRum	e588703454	[chatgpt]fix inference model load (#2988 ) * fix lora bug * polish * fix lora gemini * fix inference laod model bug	2 years ago
ver217	0ff8406b00	[chatgpt] allow shard init and display warning (#2986 )	2 years ago
BlueRum	f5ca0397dd	[chatgpt] fix lora gemini conflict in RM training (#2984 ) * fix lora bug * polish * fix lora gemini	2 years ago
ver217	19ad49fb3b	[chatgpt] making experience support dp (#2971 ) * [chatgpt] making experience support dp * [chatgpt] update example test ci * [chatgpt] update example test ci * [chatgpt] update example test ci * [chatgpt] update example test ci * [chatgpt] update sampler * [chatgpt] update example test ci * [chatgpt] refactor sampler * [chatgpt] update example test ci	2 years ago
BlueRum	c9e27f0d1b	[chatgpt]fix lora bug (#2974 ) * fix lora bug * polish	2 years ago
BlueRum	82149e9d1b	[chatgpt] fix inference demo loading bug (#2969 ) * [chatgpt] fix inference demo loading bug * polish	2 years ago
Fazzie-Maqianli	bbf9c827c3	[ChatGPT] fix README (#2966 ) * Update README.md * fix README * Update README.md * Update README.md --------- Co-authored-by: fastalgo <youyang@cs.berkeley.edu> Co-authored-by: BlueRum <70618399+ht-zhou@users.noreply.github.com>	2 years ago
binmakeswell	b0a8766381	[doc] fix chatgpt inference typo (#2964 )	2 years ago
BlueRum	489a9566af	[chatgpt]add inference example (#2944 ) * [chatgpt] support inference example * Create inference.sh * Update README.md * Delete inference.sh * Update inference.py	2 years ago
binmakeswell	8264cd7ef1	[doc] add env scope (#2933 )	2 years ago
BlueRum	2e16f842a9	[chatgpt]support opt & gpt for rm training (#2876 )	2 years ago
BlueRum	34ca324b0d	[chatgpt] Support saving ckpt in examples (#2846 ) * [chatgpt]fix train_rm bug with lora * [chatgpt]support colossalai strategy to train rm * fix pre-commit * fix pre-commit 2 * [chatgpt]fix rm eval typo * fix rm eval * fix pre commit * add support of saving ckpt in examples * fix single-gpu save	2 years ago
BlueRum	3eebc4dff7	[chatgpt] fix rm eval (#2829 ) * [chatgpt]fix train_rm bug with lora * [chatgpt]support colossalai strategy to train rm * fix pre-commit * fix pre-commit 2 * [chatgpt]fix rm eval typo * fix rm eval * fix pre commit	2 years ago
ver217	b6a108cb91	[chatgpt] add test checkpoint (#2797 ) * [chatgpt] add test checkpoint * [chatgpt] test checkpoint use smaller model	2 years ago
ver217	a619a190df	[chatgpt] update readme about checkpoint (#2792 ) * [chatgpt] add save/load checkpoint sample code * [chatgpt] add save/load checkpoint readme * [chatgpt] refactor save/load checkpoint readme	2 years ago
ver217	4ee311c026	[chatgpt] startegy add prepare method (#2766 ) * [chatgpt] startegy add prepare method * [chatgpt] refactor examples * [chatgpt] refactor strategy.prepare * [chatgpt] support save/load checkpoint * [chatgpt] fix unwrap actor * [chatgpt] fix unwrap actor	2 years ago
ver217	a88bc828d5	[chatgpt] disable shard init for colossalai (#2767 )	2 years ago
BlueRum	613efebc5c	[chatgpt] support colossalai strategy to train rm (#2742 ) * [chatgpt]fix train_rm bug with lora * [chatgpt]support colossalai strategy to train rm * fix pre-commit * fix pre-commit 2	2 years ago
BlueRum	648183a960	[chatgpt]fix train_rm bug with lora (#2741 )	2 years ago
CH.Li	7aacfad8af	fix typo (#2721 )	2 years ago
ver217	9c0943ecdb	[chatgpt] optimize generation kwargs (#2717 ) * [chatgpt] ppo trainer use default generate args * [chatgpt] example remove generation preparing fn * [chatgpt] benchmark remove generation preparing fn * [chatgpt] fix ci	2 years ago
binmakeswell	d4d3387f45	[doc] add open-source contribution invitation (#2714 ) * [doc] fix typo * [doc] add invitation	2 years ago
binmakeswell	94f000515b	[doc] add Quick Preview (#2706 )	2 years ago
binmakeswell	8408c852a6	[app] fix ChatGPT requirements (#2704 )	2 years ago
ver217	1b34701027	[app] add chatgpt application (#2698 )	2 years ago

... 2 3 4 5 6

253 Commits (feat/online-serving)