ColossalAI/tests/test_infer/test_ops/cuda
yuehuayingxueluo 04aca9e55b
[Inference/Kernel]Add get_cos_and_sin Kernel (#5528)
* Add get_cos_and_sin kernel

* fix code comments

* fix code typos

* merge common codes of get_cos_and_sin kernel.

* Fixed a typo

* Changed 'asset allclose' to 'assert equal'.
2024-04-01 13:47:14 +08:00
..
__init__.py [Inference]Add CUDA KVCache Kernel (#5406) 2024-02-28 14:36:50 +08:00
test_get_cos_and_sin.py [Inference/Kernel]Add get_cos_and_sin Kernel (#5528) 2024-04-01 13:47:14 +08:00
test_kv_cache_memcpy.py [Inference]Support FP16/BF16 Flash Attention 2 And Add high_precision Flag To Rotary Embedding (#5461) 2024-03-25 13:40:34 +08:00
test_rms_layernorm.py feat rmsnorm cuda kernel and add unittest, benchmark script (#5417) 2024-03-08 16:21:12 +08:00
test_rotary_embdding_unpad.py [Inference]Support FP16/BF16 Flash Attention 2 And Add high_precision Flag To Rotary Embedding (#5461) 2024-03-25 13:40:34 +08:00
test_silu_and_mul.py add silu_and_mul for infer 2024-03-07 16:57:49 +08:00