ColossalAI

History

Cuiqing Li 459a88c806 [Kernels]Updated Triton kernels into 2.1.0 and adding flash-decoding for llama token attention (#4965 ) * adding flash-decoding * clean * adding kernel * adding flash-decoding * add integration * add * adding kernel * adding kernel * adding triton 2.1.0 features for inference * update bloom triton kernel * remove useless vllm kernels * clean codes * fix * adding files * fix readme * update llama flash-decoding --------- Co-authored-by: cuiqing.li <lixx336@gmail.com>		2023-10-30 14:04:37 +08:00
..
cuda_native	[kernel] support pure fp16 for cpu adam and update gemini optim tests (#4921 )	2023-10-16 21:56:53 +08:00
jit	[misc] update pre-commit and run all files (#4752 )	2023-09-19 14:20:26 +08:00
triton	[Kernels]Updated Triton kernels into 2.1.0 and adding flash-decoding for llama token attention (#4965 )	2023-10-30 14:04:37 +08:00
__init__.py	[hotfix] Fix import error: colossal.kernel without triton installed (#4722 )	2023-09-14 18:03:55 +08:00
op_builder	[builder] reconfig op_builder for pypi install (#2314 )	2023-01-04 16:32:32 +08:00