ColossalAI/tests/test_infer/test_ops/triton
yuehuayingxueluo 2a718c8be8
Optimized the execution interval time between cuda kernels caused by view and memcopy (#5390)
* opt_view_and_memcopy

* fix bugs in ci

* fix ci bugs

* update benchmark scripts

* fix ci bugs
2024-02-21 13:23:57 +08:00
..
kernel_utils.py [inference] moved ops tests to test_infer (#5354) 2024-02-02 13:51:22 +08:00
test_context_attn_unpad.py Optimized the execution interval time between cuda kernels caused by view and memcopy (#5390) 2024-02-21 13:23:57 +08:00
test_decoding_attn.py [inference] moved ops tests to test_infer (#5354) 2024-02-02 13:51:22 +08:00
test_fused_rotary_embedding.py [inference] moved ops tests to test_infer (#5354) 2024-02-02 13:51:22 +08:00
test_kvcache_copy.py [Inference/opt] Fused KVCahce Memcopy (#5374) 2024-02-07 17:15:42 +08:00
test_rmsnorm_triton.py Optimized the execution interval time between cuda kernels caused by view and memcopy (#5390) 2024-02-21 13:23:57 +08:00
test_rotary_embdding_unpad.py [Inference]Fused kv copy into rotary calculation (#5383) 2024-02-21 11:31:48 +08:00
test_xine_copy.py [inference] moved ops tests to test_infer (#5354) 2024-02-02 13:51:22 +08:00