ColossalAI

Commit Graph

Author	SHA1	Message	Date
BlueRum	7548ca5a54	[chatgpt]Reward Model Training Process update (#3133 ) * add normalize function to value_head in bloom rm * add normalization to value_function in gpt_rm * add normalization to value_head of opt_rm * add Anthropic/hh-rlhf dataset * Update __init__.py * Add LogExpLoss in RM training * Update __init__.py * update rm trainer to use acc as target * update example/train_rm * Update train_rm.sh * code style * Update README.md * Update README.md * add rm test to ci * fix tokenier * fix typo * change batchsize to avoid oom in ci * Update test_ci.sh	2023-03-20 09:59:06 +08:00
github-actions[bot]	e86d9bb2e1	[format] applied code formatting on changed files in pull request 3025 (#3026 ) Co-authored-by: github-actions <github-actions@github.com>	2023-03-07 12:55:17 +08:00
BlueRum	55dcd3051a	[chatgpt] fix readme (#3025 )	2023-03-07 10:21:25 +08:00
BlueRum	e588703454	[chatgpt]fix inference model load (#2988 ) * fix lora bug * polish * fix lora gemini * fix inference laod model bug	2023-03-07 09:17:52 +08:00
Fazzie-Maqianli	bbf9c827c3	[ChatGPT] fix README (#2966 ) * Update README.md * fix README * Update README.md * Update README.md --------- Co-authored-by: fastalgo <youyang@cs.berkeley.edu> Co-authored-by: BlueRum <70618399+ht-zhou@users.noreply.github.com>	2023-03-02 15:00:05 +08:00
binmakeswell	b0a8766381	[doc] fix chatgpt inference typo (#2964 )	2023-03-02 11:22:08 +08:00
BlueRum	489a9566af	[chatgpt]add inference example (#2944 ) * [chatgpt] support inference example * Create inference.sh * Update README.md * Delete inference.sh * Update inference.py	2023-03-01 13:39:39 +08:00
ver217	1b34701027	[app] add chatgpt application (#2698 )	2023-02-14 22:17:25 +08:00

8 Commits (bbac6760e59beed8be6d74f62f9589c8f7240cda)