ColossalAI/tests/test_infer
Jianghai e0757c31fb
[inference] Dynamic Batching for Single and Multiple GPUs (#4831)
* finish batch manager

* 1

* first

* fix

* fix dynamic batching

* llama infer

* finish test

* support different lengths generating

* del prints

* del prints

* fix

* fix bug

---------

Co-authored-by: CjhHa1 <cjh18671720497outlook.com>
2023-10-11 17:52:52 +08:00
..
test_dynamic_batching [inference] Dynamic Batching for Single and Multiple GPUs (#4831) 2023-10-11 17:52:52 +08:00
_utils.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
test_bloom_infer.py [infer] fix test bug (#4838) 2023-10-04 10:01:03 +08:00
test_chatglm2_infer.py [infer] fix test bug (#4838) 2023-10-04 10:01:03 +08:00
test_infer_engine.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
test_kvcache_manager.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
test_llama_infer.py [inference] Dynamic Batching for Single and Multiple GPUs (#4831) 2023-10-11 17:52:52 +08:00