Commit Graph

1 Commits (e0757c31fb4491fef908d897846d47b030fd56f1)

Author SHA1 Message Date
Jianghai e0757c31fb
[inference] Dynamic Batching for Single and Multiple GPUs (#4831)
* finish batch manager

* 1

* first

* fix

* fix dynamic batching

* llama infer

* finish test

* support different lengths generating

* del prints

* del prints

* fix

* fix bug

---------

Co-authored-by: CjhHa1 <cjh18671720497outlook.com>
2023-10-11 17:52:52 +08:00