ColossalAI/tests/test_infer/test_ops
yuehuayingxueluo 12f10d5b0b
[Fix/Inference]Fix CUDA Rotary Rmbedding GQA (#5623)
* fix rotary embedding GQA

* change test_rotary_embdding_unpad.py KH
2024-04-23 13:44:49 +08:00
..
cuda [Fix/Inference]Fix CUDA Rotary Rmbedding GQA (#5623) 2024-04-23 13:44:49 +08:00
triton [Inference/Kernel] Add Paged Decoding kernel, sequence split within the same thread block (#5531) 2024-04-18 16:45:07 +08:00
__init__.py [Inference]Add CUDA KVCache Kernel (#5406) 2024-02-28 14:36:50 +08:00