ColossalAI/tests/test_infer/test_ops/triton
Yuanheng Zhao d63c469f45 [Infer] Revise and Adapt Triton Kernels for Spec-Dec (#5401)
* [Infer/Fix] Fix Dependency in test - RMSNorm kernel (#5399)

fix dependency in pytest

* resolve conflicts for revising flash-attn

* adapt kv cache copy kernel for spec-dec

* fix seqlen-n kvcache copy kernel/tests

* test kvcache copy - use torch.equal

* add assertions

* (trivial) comment out
2024-04-10 11:07:51 +08:00
..
__init__.py [Inference]Add CUDA KVCache Kernel (#5406) 2024-02-28 14:36:50 +08:00
kernel_utils.py [Infer] Revise and Adapt Triton Kernels for Spec-Dec (#5401) 2024-04-10 11:07:51 +08:00
test_context_attn_unpad.py [Inference]Move benchmark-related code to the example directory. (#5408) 2024-02-28 16:46:03 +08:00
test_decoding_attn.py [Infer] Revise and Adapt Triton Kernels for Spec-Dec (#5401) 2024-04-10 11:07:51 +08:00
test_fused_rotary_embedding.py [Inference]Move benchmark-related code to the example directory. (#5408) 2024-02-28 16:46:03 +08:00
test_kvcache_copy.py [Infer] Revise and Adapt Triton Kernels for Spec-Dec (#5401) 2024-04-10 11:07:51 +08:00
test_rmsnorm_triton.py [Inference]Move benchmark-related code to the example directory. (#5408) 2024-02-28 16:46:03 +08:00
test_rotary_embdding_unpad.py [Inference]Move benchmark-related code to the example directory. (#5408) 2024-02-28 16:46:03 +08:00
test_xine_copy.py [Inference]Move benchmark-related code to the example directory. (#5408) 2024-02-28 16:46:03 +08:00