ColossalAI/tests/test_infer_ops/triton
Jianghai c647e00e3c
[Inference]Add fused rotary kernel and get cos cache kernel (#5302)
* add fused rotary and get cos cache func

* staged

* fix bugs

* fix bugs
2024-01-24 16:20:42 +08:00
..
kernel_utils.py [kernel/fix] Performance Optimization for Decoding Kernel and Benchmarking (#5274) 2024-01-19 15:47:16 +08:00
test_context_attn_unpad.py [Kernel/Fix] Revise flash attention triton kernel API and add benchmark (#5301) 2024-01-23 17:16:02 +08:00
test_decoding_attn.py [Kernel/Fix] Revise flash attention triton kernel API and add benchmark (#5301) 2024-01-23 17:16:02 +08:00
test_fused_rotary_embedding.py [Inference]Add fused rotary kernel and get cos cache kernel (#5302) 2024-01-24 16:20:42 +08:00
test_kvcache_copy.py [kernel] Revise KVCache copy triton kernel API (#5273) 2024-01-16 14:41:02 +08:00
test_llama_act_combine.py [moe] merge moe into main (#4978) 2023-11-02 02:21:24 +00:00
test_rmsnorm_triton.py [kernel] Add RMSLayerNorm triton kernel (#5262) 2024-01-18 10:21:03 +08:00
test_rotary_embdding_unpad.py [Inference] Benchmarking rotary embedding and add a fetch function (#5277) 2024-01-23 12:11:53 +08:00
test_softmax.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
test_xine_copy.py [Inference]Add fused rotary kernel and get cos cache kernel (#5302) 2024-01-24 16:20:42 +08:00