BlueRum
|
3eebc4dff7
|
[chatgpt] fix rm eval (#2829)
* [chatgpt]fix train_rm bug with lora
* [chatgpt]support colossalai strategy to train rm
* fix pre-commit
* fix pre-commit 2
* [chatgpt]fix rm eval typo
* fix rm eval
* fix pre commit
|
2023-02-21 11:35:45 +08:00 |
ver217
|
4ee311c026
|
[chatgpt] startegy add prepare method (#2766)
* [chatgpt] startegy add prepare method
* [chatgpt] refactor examples
* [chatgpt] refactor strategy.prepare
* [chatgpt] support save/load checkpoint
* [chatgpt] fix unwrap actor
* [chatgpt] fix unwrap actor
|
2023-02-17 11:27:27 +08:00 |
ver217
|
a88bc828d5
|
[chatgpt] disable shard init for colossalai (#2767)
|
2023-02-16 20:09:34 +08:00 |
BlueRum
|
613efebc5c
|
[chatgpt] support colossalai strategy to train rm (#2742)
* [chatgpt]fix train_rm bug with lora
* [chatgpt]support colossalai strategy to train rm
* fix pre-commit
* fix pre-commit 2
|
2023-02-16 11:24:07 +08:00 |
BlueRum
|
648183a960
|
[chatgpt]fix train_rm bug with lora (#2741)
|
2023-02-16 10:25:17 +08:00 |
ver217
|
9c0943ecdb
|
[chatgpt] optimize generation kwargs (#2717)
* [chatgpt] ppo trainer use default generate args
* [chatgpt] example remove generation preparing fn
* [chatgpt] benchmark remove generation preparing fn
* [chatgpt] fix ci
|
2023-02-15 13:59:58 +08:00 |
ver217
|
1b34701027
|
[app] add chatgpt application (#2698)
|
2023-02-14 22:17:25 +08:00 |