ColossalAI/tests/test_infer
yuehuayingxueluo b45000f839
[Inference]Add Streaming LLM (#5745)
* Add Streaming LLM

* add some parameters to llama_generation.py

* verify streamingllm config

* add test_streamingllm.py

* modified according to the opinions of review

* add Citation

* change _block_tables tolist
2024-06-05 10:51:19 +08:00
..
test_async_engine [Inference] Fix bugs and docs for feat/online-server (#5598) 2024-05-08 15:20:53 +00:00
test_kernels [release] update version (#5752) 2024-05-31 19:40:26 +08:00
test_models [Fix] Fix Inference Example, Tests, and Requirements (#5688) 2024-05-08 11:30:15 +08:00
__init__.py [Fix] Fix Inference Example, Tests, and Requirements (#5688) 2024-05-08 11:30:15 +08:00
_utils.py
test_batch_bucket.py
test_config_and_struct.py [Fix] Fix Inference Example, Tests, and Requirements (#5688) 2024-05-08 11:30:15 +08:00
test_continuous_batching.py [inference] Fix running time of test_continuous_batching (#5750) 2024-05-24 19:34:15 +08:00
test_cuda_graph.py [Fix] Fix Inference Example, Tests, and Requirements (#5688) 2024-05-08 11:30:15 +08:00
test_drafter.py [Fix] Fix Inference Example, Tests, and Requirements (#5688) 2024-05-08 11:30:15 +08:00
test_inference_engine.py [Inference] Fix bugs and docs for feat/online-server (#5598) 2024-05-08 15:20:53 +00:00
test_kvcache_manager.py [Fix] Fix & Update Inference Tests (compatibility w/ main) 2024-05-05 16:28:56 +00:00
test_request_handler.py [Fix] Fix & Update Inference Tests (compatibility w/ main) 2024-05-05 16:28:56 +00:00
test_rpc_engine.py [release] update version (#5752) 2024-05-31 19:40:26 +08:00
test_streamingllm.py [Inference]Add Streaming LLM (#5745) 2024-06-05 10:51:19 +08:00