mirror of https://github.com/hpcaitech/ColossalAI
3eebc4dff7
* [chatgpt]fix train_rm bug with lora * [chatgpt]support colossalai strategy to train rm * fix pre-commit * fix pre-commit 2 * [chatgpt]fix rm eval typo * fix rm eval * fix pre commit |
||
---|---|---|
.. | ||
callbacks | ||
strategies | ||
__init__.py | ||
base.py | ||
ppo.py | ||
rm.py | ||
utils.py |