Commit Graph

111 Commits (2703a37ac91beffa3e62a0e179726fa5d15d73b1)

Author SHA1 Message Date
KAIYUAN GAN 229382c844
[NFC] polish colossalai/kernel/cuda_native/csrc/kernels/cuda_util.cu code stype (#625)
3 years ago
LuGY 6a3f9fda83
[cuda] modify the fused adam, support hybrid of fp16 and fp32 (#497)
3 years ago
ExtremeViscent eaac03ae1d [formart] format fixed for kernel\cuda_native codes (#335)
3 years ago
LuGY a3269de5c9 [zero] cpu adam kernel (#288)
3 years ago
1SAA 219df6e685 Optimized MoE layer and fixed some bugs;
3 years ago
アマデウス 9ee197d0e9 moved env variables to global variables; (#215)
3 years ago
HELSON 0f8c7f9804
Fixed docstring in colossalai (#171)
3 years ago
Frank Lee e2089c5c15
adapted for sequence parallel (#163)
3 years ago
Frank Lee f3802d6b06
fixed jit default setting (#154)
3 years ago
ver217 f68eddfb3d
refactor kernel (#142)
3 years ago
shenggan 5c3843dc98
add colossalai kernel module (#55)
3 years ago