ColossalAI/extensions/pybind
Steve Luo 5cd75ce4c7
[Inference/Kernel] refactor kvcache manager and rotary_embedding and kvcache_memcpy oper… (#5663)
* refactor kvcache manager and rotary_embedding and kvcache_memcpy operator

* refactor decode_kv_cache_memcpy

* enable alibi in pagedattention
2024-04-30 15:52:23 +08:00
..
cpu_adam [Inference/Refactor] Refactor compilation mechanism and unified multi hw (#5613) 2024-04-24 14:17:54 +08:00
flash_attention [Inference/Refactor] Refactor compilation mechanism and unified multi hw (#5613) 2024-04-24 14:17:54 +08:00
inference [Inference/Kernel] refactor kvcache manager and rotary_embedding and kvcache_memcpy oper… (#5663) 2024-04-30 15:52:23 +08:00
layernorm [Inference/Refactor] Refactor compilation mechanism and unified multi hw (#5613) 2024-04-24 14:17:54 +08:00
moe [Inference/Refactor] Refactor compilation mechanism and unified multi hw (#5613) 2024-04-24 14:17:54 +08:00
optimizer [Inference/Refactor] Refactor compilation mechanism and unified multi hw (#5613) 2024-04-24 14:17:54 +08:00
softmax [Inference/Refactor] Refactor compilation mechanism and unified multi hw (#5613) 2024-04-24 14:17:54 +08:00
__init__.py [Inference/Refactor] Refactor compilation mechanism and unified multi hw (#5613) 2024-04-24 14:17:54 +08:00