ColossalAI

Making large AI models cheaper, faster and more accessible

History

Yuanheng Zhao bd38fe6b91 [NFC] Fix code factors on inference triton kernels (#5743 )		6 months ago
..
__init__.py	[Infer] Revise and Adapt Triton Kernels for Spec-Dec (#5401 )	8 months ago
context_attn_unpad.py	[kernel] Support New KCache Layout - Triton Kernel (#5677 )	7 months ago
flash_decoding.py	[NFC] Fix code factors on inference triton kernels (#5743 )	6 months ago
fused_rotary_embedding.py	[Inference]Fused the gate and up proj in mlp，and optimized the autograd process. (#5365 )	10 months ago
kvcache_copy.py	[kernel] Support New KCache Layout - Triton Kernel (#5677 )	7 months ago
llama_act_combine_kernel.py	[devops] remove post commit ci (#5566 )	8 months ago
no_pad_rotary_embedding.py	[kernel] Support New KCache Layout - Triton Kernel (#5677 )	7 months ago
qkv_matmul_kernel.py	[misc] update pre-commit and run all files (#4752 )	1 year ago
rms_layernorm.py	[fix] multi graphs capture error	9 months ago
rotary_cache_copy.py	[Inference]Fused the gate and up proj in mlp，and optimized the autograd process. (#5365 )	10 months ago
softmax.py	[misc] update pre-commit and run all files (#4752 )	1 year ago