ColossalAI/colossalai/inference/core
傅剑寒 e6496dd371
[Inference] Optimize request handler of llama (#5512)
* optimize request_handler

* fix ways of writing
2024-03-26 16:37:14 +08:00
..
__init__.py [doc] updated inference readme (#5343) 2024-02-02 14:31:10 +08:00
engine.py [fix] merge conflicts 2024-03-25 14:48:28 +08:00
request_handler.py [Inference] Optimize request handler of llama (#5512) 2024-03-26 16:37:14 +08:00