ColossalAI/extensions/csrc
傅剑寒 1ace1065e6
[Inference/Feat] Add quant kvcache support for decode_kv_cache_memcpy (#5686)
2024-05-06 15:35:13 +08:00
..
common [Inference] Remove unnecessary float4_ and rename float8_ to float8 (#5679) 2024-05-06 10:55:34 +08:00
funcs [Inference] Remove unnecessary float4_ and rename float8_ to float8 (#5679) 2024-05-06 10:55:34 +08:00
kernel [Inference/Feat] Add quant kvcache support for decode_kv_cache_memcpy (#5686) 2024-05-06 15:35:13 +08:00
__init__.py [Inference/Refactor] Delete Duplicated code and refactor vec_copy utils and reduce utils (#5593) 2024-04-15 10:57:51 +08:00