Commit Graph

10 Commits (214da761d4df0461fa49bd23c501d661bbaa8436)

Author SHA1 Message Date
LuGY 6a3f9fda83
[cuda] modify the fused adam, support hybrid of fp16 and fp32 (#497)
3 years ago
ExtremeViscent eaac03ae1d [formart] format fixed for kernel\cuda_native codes (#335)
3 years ago
LuGY a3269de5c9 [zero] cpu adam kernel (#288)
3 years ago
1SAA 219df6e685 Optimized MoE layer and fixed some bugs;
3 years ago
アマデウス 9ee197d0e9 moved env variables to global variables; (#215)
3 years ago
HELSON 0f8c7f9804
Fixed docstring in colossalai (#171)
3 years ago
Frank Lee e2089c5c15
adapted for sequence parallel (#163)
3 years ago
Frank Lee f3802d6b06
fixed jit default setting (#154)
3 years ago
ver217 f68eddfb3d
refactor kernel (#142)
3 years ago
shenggan 5c3843dc98
add colossalai kernel module (#55)
3 years ago