mirror of https://github.com/hpcaitech/ColossalAI
07b5283b6a
* add context attn unpadded triton kernel * test compatibility * kv cache copy (testing) * fix k/v cache copy * fix kv cache copy and test * fix boundary of block ptrs * add support for GQA/MQA and testing * fix import statement --------- Co-authored-by: Round Heng <yuanhengzhao@Rounds-MacBook-Pro.local> |
||
---|---|---|
.. | ||
cuda_native | ||
jit | ||
triton | ||
__init__.py | ||
op_builder |