mirror of https://github.com/hpcaitech/ColossalAI
3eebc4dff7
* [chatgpt]fix train_rm bug with lora * [chatgpt]support colossalai strategy to train rm * fix pre-commit * fix pre-commit 2 * [chatgpt]fix rm eval typo * fix rm eval * fix pre commit |
||
---|---|---|
.. | ||
dataset | ||
experience_maker | ||
nn | ||
replay_buffer | ||
trainer | ||
__init__.py |