ColossalAI/extensions/csrc
傅剑寒 8ccb6714e7
[Inference/Feat] Add kvcache quantization support for FlashDecoding (#5656)
2024-04-26 19:40:37 +08:00
..
common [Inference/Feat] Add kvcache quantization support for FlashDecoding (#5656) 2024-04-26 19:40:37 +08:00
funcs [Inference/Feat] Add kvcache quantization support for FlashDecoding (#5656) 2024-04-26 19:40:37 +08:00
kernel [Inference/Feat] Add kvcache quantization support for FlashDecoding (#5656) 2024-04-26 19:40:37 +08:00
__init__.py [Inference/Refactor] Delete Duplicated code and refactor vec_copy utils and reduce utils (#5593) 2024-04-15 10:57:51 +08:00