ColossalAI/tests/test_infer/test_ops
Yuanheng Zhao d63c469f45 [Infer] Revise and Adapt Triton Kernels for Spec-Dec (#5401)
* [Infer/Fix] Fix Dependency in test - RMSNorm kernel (#5399)

fix dependency in pytest

* resolve conflicts for revising flash-attn

* adapt kv cache copy kernel for spec-dec

* fix seqlen-n kvcache copy kernel/tests

* test kvcache copy - use torch.equal

* add assertions

* (trivial) comment out
2024-04-10 11:07:51 +08:00
..
cuda [Inference/Kernel]Add get_cos_and_sin Kernel (#5528) 2024-04-01 13:47:14 +08:00
triton [Infer] Revise and Adapt Triton Kernels for Spec-Dec (#5401) 2024-04-10 11:07:51 +08:00
__init__.py [Inference]Add CUDA KVCache Kernel (#5406) 2024-02-28 14:36:50 +08:00