ColossalAI

Commit Graph

Author	SHA1	Message	Date
Fazzie-Maqianli	c21b11edce	change nn to models (#3032 )	2023-03-07 16:34:22 +08:00
LuGY	287d60499e	[chatgpt] Add saving ckpt callback for PPO (#2880 ) * add checkpoint callback for chatgpt * add save ckpt callbacks for ppo --------- Co-authored-by: Fazzie-Maqianli <55798671+Fazziekey@users.noreply.github.com>	2023-03-07 10:13:25 +08:00
ver217	0ff8406b00	[chatgpt] allow shard init and display warning (#2986 )	2023-03-03 16:27:59 +08:00
BlueRum	f5ca0397dd	[chatgpt] fix lora gemini conflict in RM training (#2984 ) * fix lora bug * polish * fix lora gemini	2023-03-03 15:58:16 +08:00
ver217	19ad49fb3b	[chatgpt] making experience support dp (#2971 ) * [chatgpt] making experience support dp * [chatgpt] update example test ci * [chatgpt] update example test ci * [chatgpt] update example test ci * [chatgpt] update example test ci * [chatgpt] update sampler * [chatgpt] update example test ci * [chatgpt] refactor sampler * [chatgpt] update example test ci	2023-03-03 15:51:19 +08:00
BlueRum	c9e27f0d1b	[chatgpt]fix lora bug (#2974 ) * fix lora bug * polish	2023-03-02 17:51:44 +08:00
BlueRum	2e16f842a9	[chatgpt]support opt & gpt for rm training (#2876 )	2023-02-22 16:58:11 +08:00
BlueRum	3eebc4dff7	[chatgpt] fix rm eval (#2829 ) * [chatgpt]fix train_rm bug with lora * [chatgpt]support colossalai strategy to train rm * fix pre-commit * fix pre-commit 2 * [chatgpt]fix rm eval typo * fix rm eval * fix pre commit	2023-02-21 11:35:45 +08:00
ver217	4ee311c026	[chatgpt] startegy add prepare method (#2766 ) * [chatgpt] startegy add prepare method * [chatgpt] refactor examples * [chatgpt] refactor strategy.prepare * [chatgpt] support save/load checkpoint * [chatgpt] fix unwrap actor * [chatgpt] fix unwrap actor	2023-02-17 11:27:27 +08:00
ver217	a88bc828d5	[chatgpt] disable shard init for colossalai (#2767 )	2023-02-16 20:09:34 +08:00
BlueRum	613efebc5c	[chatgpt] support colossalai strategy to train rm (#2742 ) * [chatgpt]fix train_rm bug with lora * [chatgpt]support colossalai strategy to train rm * fix pre-commit * fix pre-commit 2	2023-02-16 11:24:07 +08:00
BlueRum	648183a960	[chatgpt]fix train_rm bug with lora (#2741 )	2023-02-16 10:25:17 +08:00
ver217	9c0943ecdb	[chatgpt] optimize generation kwargs (#2717 ) * [chatgpt] ppo trainer use default generate args * [chatgpt] example remove generation preparing fn * [chatgpt] benchmark remove generation preparing fn * [chatgpt] fix ci	2023-02-15 13:59:58 +08:00
ver217	1b34701027	[app] add chatgpt application (#2698 )	2023-02-14 22:17:25 +08:00

14 Commits (c21b11edce3b772cdbcb4e5fafe95f62ac49af94)