mirror of https://github.com/hpcaitech/ColossalAI
![]() * fix decoding kernel pytest * revise and add triton context attn benchmark |
||
---|---|---|
.. | ||
__init__.py | ||
context_attn_unpad.py | ||
custom_autotune.py | ||
flash_decoding.py | ||
flash_decoding_utils.py | ||
gptq_triton.py | ||
kvcache_copy.py | ||
llama_act_combine_kernel.py | ||
no_pad_rotary_embedding.py | ||
qkv_matmul_kernel.py | ||
rms_layernorm.py | ||
softmax.py |