ColossalAI/colossalai/kernel
Jianghai e0757c31fb
[inference] Dynamic Batching for Single and Multiple GPUs (#4831)
* finish batch manager

* 1

* first

* fix

* fix dynamic batching

* llama infer

* finish test

* support different lengths generating

* del prints

* del prints

* fix

* fix bug

---------

Co-authored-by: CjhHa1 <cjh18671720497outlook.com>
2023-10-11 17:52:52 +08:00
..
cuda_native [NFC] polish code style (#4799) 2023-10-07 13:36:52 +08:00
jit [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
triton [inference] Dynamic Batching for Single and Multiple GPUs (#4831) 2023-10-11 17:52:52 +08:00
__init__.py [hotfix] Fix import error: colossal.kernel without triton installed (#4722) 2023-09-14 18:03:55 +08:00
op_builder [builder] reconfig op_builder for pypi install (#2314) 2023-01-04 16:32:32 +08:00