mirror of https://github.com/hpcaitech/ColossalAI
b45000f839
* Add Streaming LLM * add some parameters to llama_generation.py * verify streamingllm config * add test_streamingllm.py * modified according to the opinions of review * add Citation * change _block_tables tolist |
||
---|---|---|
.. | ||
test_async_engine | ||
test_kernels | ||
test_models | ||
__init__.py | ||
_utils.py | ||
test_batch_bucket.py | ||
test_config_and_struct.py | ||
test_continuous_batching.py | ||
test_cuda_graph.py | ||
test_drafter.py | ||
test_inference_engine.py | ||
test_kvcache_manager.py | ||
test_request_handler.py | ||
test_rpc_engine.py | ||
test_streamingllm.py |