ColossalAI/colossalai/inference/core
Yuanheng Zhao 912e24b2aa [SpecDec] Fix inputs for speculation and revise past KV trimming (#5449)
* fix drafter pastkv and usage of batch bucket
2024-04-10 11:07:52 +08:00
..
__init__.py [doc] updated inference readme (#5343) 2024-02-02 14:31:10 +08:00
engine.py [SpecDec] Fix inputs for speculation and revise past KV trimming (#5449) 2024-04-10 11:07:52 +08:00
request_handler.py [SpecDec] Fix inputs for speculation and revise past KV trimming (#5449) 2024-04-10 11:07:52 +08:00