ColossalAI/applications/ChatGPT/chatgpt/trainer
pgzhang b429529365
[chatgpt] add supervised learning fine-tune code (#3183)
* [chatgpt] add supervised fine-tune code

* [chatgpt] delete unused code and modified comment code

* [chatgpt] use pytorch distributed sampler instead

---------

Co-authored-by: zhangpengpeng <zhangpengpeng@joyy.com>
2023-03-22 09:59:42 +08:00
..
callbacks [chatgpt] Add saving ckpt callback for PPO (#2880) 2023-03-07 10:13:25 +08:00
strategies [chatgpt] fix lora save bug (#3099) 2023-03-10 17:58:10 +08:00
__init__.py [chatgpt] add supervised learning fine-tune code (#3183) 2023-03-22 09:59:42 +08:00
base.py [chatgpt] making experience support dp (#2971) 2023-03-03 15:51:19 +08:00
ppo.py [chatgpt] fix trainer generate kwargs (#3166) 2023-03-17 17:31:22 +08:00
rm.py [chatgpt]Reward Model Training Process update (#3133) 2023-03-20 09:59:06 +08:00
sft.py [chatgpt] add supervised learning fine-tune code (#3183) 2023-03-22 09:59:42 +08:00
utils.py [app] add chatgpt application (#2698) 2023-02-14 22:17:25 +08:00