ColossalAI/extensions/csrc/funcs
傅剑寒 9df016fc45
[Inference] Fix quant bits order (#5681)
2024-04-30 19:38:00 +08:00
..
binary_functor.h [Inference/Feat] Add kvcache quant support for fused_rotary_embedding_cache_copy (#5680) 2024-04-30 18:33:53 +08:00
cast_functor.h [Inference] Fix quant bits order (#5681) 2024-04-30 19:38:00 +08:00
reduce_function.h [Inference/Refactor] Refactor compilation mechanism and unified multi hw (#5613) 2024-04-24 14:17:54 +08:00
ternary_functor.h [Inference/Refactor] Refactor compilation mechanism and unified multi hw (#5613) 2024-04-24 14:17:54 +08:00
unary_functor.h [Inference/Feat] Add kvcache quantization support for FlashDecoding (#5656) 2024-04-26 19:40:37 +08:00