Hongxin Liu
|
7bd0bee8ea
|
[chat] add opt attn kernel (#3655)
* [chat] add opt attn kernel
* [chat] disable xformer during fwd
|
2023-05-04 16:03:33 +08:00 |
Hongxin Liu
|
2a951955ad
|
[chat] refactor trainer (#3648)
* [chat] ppo trainer remove useless args
* [chat] update examples
* [chat] update benchmark
* [chat] update examples
* [chat] fix sft training with wandb
* [chat] polish docstr
|
2023-04-26 18:11:49 +08:00 |
Hongxin Liu
|
50793b35f4
|
[gemini] accelerate inference (#3641)
* [gemini] support don't scatter after inference
* [chat] update colossalai strategy
* [chat] fix opt benchmark
* [chat] update opt benchmark
* [gemini] optimize inference
* [test] add gemini inference test
* [chat] fix unit test ci
* [chat] fix ci
* [chat] fix ci
* [chat] skip checkpoint test
|
2023-04-26 16:32:40 +08:00 |
Yuanchen
|
1ec0d386a9
|
reconstruct chat trainer and fix training script (#3588)
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
|
2023-04-18 16:44:03 +08:00 |
Fazzie-Maqianli
|
b0ce5a1032
|
[Coati] first commit (#3283)
|
2023-03-28 20:25:36 +08:00 |