12 Commits (ColossalChat)

Author SHA1 Message Date
Frank Lee 7cfed5f076
[feat] refactored extension module (#5298) 10 months ago
Xuanlei Zhao dd2c28a323
[npu] use extension for op builder (#5172) 11 months ago
Yuanheng Zhao e2c0e7f92a
[hotfix] Fix import error: colossal.kernel without triton installed (#4722) 1 year ago
Cuiqing Li bce0f16702
[Feature] The first PR to Add TP inference engine, kv-cache manager and related kernels for our inference system (#4577) 1 year ago
Frank Lee 40d376c566
[setup] support pre-build and jit-build of cuda kernels (#2374) 2 years ago
Jiarui Fang 16cc8e6aa7
[builder] MOE builder (#2277) 2 years ago
Jiarui Fang db4cbdc7fb
[builder] builder for scaled_upper_triang_masked_softmax (#2234) 2 years ago
Jiarui Fang 1cb532ffec
[builder] multihead attn runtime building (#2203) 2 years ago
Jiarui Fang 355ffb386e
[builder] unified cpu_optim fused_optim inferface (#2190) 2 years ago
Xu Kai 2a915a8b62 fix format (#568) 3 years ago
ver217 f68eddfb3d
refactor kernel (#142) 3 years ago
shenggan 5c3843dc98
add colossalai kernel module (#55) 3 years ago