14 Commits (258b43317c4a5cafb8d3da0ff63c8843443bc448)

Author SHA1 Message Date
Frank Lee 40d376c566
[setup] support pre-build and jit-build of cuda kernels (#2374) 2 years ago
Jiarui Fang db6eea3583
[builder] reconfig op_builder for pypi install (#2314) 2 years ago
Jiarui Fang 355ffb386e
[builder] unified cpu_optim fused_optim inferface (#2190) 2 years ago
Jiarui Fang 9587b080ba
[builder] use runtime builder for fused_optim (#2189) 2 years ago
Jiarui Fang bc0e271e71
[buider] use builder() for cpu adam and fused optim in setup.py (#2187) 2 years ago
Jiarui Fang d42afd30f8
[builder] runtime adam and fused_optim builder (#2184) 2 years ago
HELSON e7d3afc9cc
[optimizer] add div_scale for optimizers (#2117) 2 years ago
ver217 f8a7148dec
[kernel] move all symlinks of kernel to `colossalai._C` (#1971) 2 years ago
ver217 12b4887097
[hotfix] fix CPUAdam kernel nullptr (#1410) 2 years ago
ver217 c415240db6
[nvme] CPUAdam and HybridAdam support NVMe offload (#1360) 2 years ago
LuGY 105c5301c3
[zero]added hybrid adam, removed loss scale in adam (#527) 3 years ago
LuGY 6a3f9fda83
[cuda] modify the fused adam, support hybrid of fp16 and fp32 (#497) 3 years ago
Jiarui Fang 5d7dc3525b
[hotfix] run cpu adam unittest in pytest (#424) 3 years ago
LuGY a3269de5c9 [zero] cpu adam kernel (#288) 3 years ago