ColossalAI/colossalai/inference/kv_cache
傅剑寒 bfad39357b
[Inference/Feat] Add quant kvcache interface (#5700)
* add quant kvcache interface

* delete unused output

* complete args comments
2024-05-09 18:03:24 +08:00
..
__init__.py [Inference] Add CacheBlock and KV-Cache Manager (#5156) 2024-01-11 13:39:29 +00:00
block_cache.py [doc] updated inference readme (#5343) 2024-02-02 14:31:10 +08:00
kvcache_manager.py [Inference/Feat] Add quant kvcache interface (#5700) 2024-05-09 18:03:24 +08:00