Commit Graph

33 Commits (78483a9fdd226db7516ec498acd764d44bece6c6)

Author SHA1 Message Date
Jiarui Fang 1cb532ffec
[builder] multihead attn runtime building (#2203)
2 years ago
Jiarui Fang bc0e271e71
[buider] use builder() for cpu adam and fused optim in setup.py (#2187)
2 years ago
Frank Lee 81e0da7fa8
[setup] supported conda-installed torch (#2048)
2 years ago
ver217 f8a7148dec
[kernel] move all symlinks of kernel to `colossalai._C` (#1971)
2 years ago
Boyuan Yao 1df98d5b66
[autoparallel] add rotor C version (#1658)
2 years ago
Super Daniel be229217ce
[fx] add torchaudio test (#1369)
2 years ago
ver217 1d625fcd36
[setup] support more cuda architectures (#920)
3 years ago
ver217 5d8f1262fb
update cuda ext cc flags (#919)
3 years ago
ver217 150b1a7453
update local version format (#909)
3 years ago
ver217 daf59ff72e
[setup] add local version label (#890)
3 years ago
Frank Lee 9f6f656952
[setup] use env var instead of option for cuda ext (#839)
3 years ago
Frank Lee 5e00e6cf23
[setup] allow installation with python 3.6 (#834)
3 years ago
FrankLeeeee f63e91d280 [cli] fixed a bug in user args and refactored the module structure
3 years ago
Jiarui Fang e761ad2cd7
Revert "[zero] add ZeroTensorShardStrategy (#793)" (#806)
3 years ago
HELSON 88759e289e
[zero] add ZeroTensorShardStrategy (#793)
3 years ago
Frank Lee 05d9ae5999
[cli] add missing requirement (#805)
3 years ago
YuliangLiu0306 cfadc9df8e
[cli] added distributed launcher command (#791)
3 years ago
Frank Lee a5c3f072f6
[bug] removed zero installation requirements (#731)
3 years ago
Frank Lee f0d6e2208b
[polish] add license meta to setup.py (#427)
3 years ago
xyupeng af801cb4df fix format setup.py (#343)
3 years ago
LuGY a3269de5c9 [zero] cpu adam kernel (#288)
3 years ago
FrankLeeeee dfc3fafe89 update unit testing CI rules
3 years ago
FrankLeeeee bbbfe9b2c9 added compatibility CI and options for release ci
3 years ago
1SAA 219df6e685 Optimized MoE layer and fixed some bugs;
3 years ago
ver217 24f8583cc4 update setup info (#233)
3 years ago
ver217 578ea0583b update setup and workflow (#222)
3 years ago
ver217 f68eddfb3d
refactor kernel (#142)
3 years ago
shenggan 5c3843dc98
add colossalai kernel module (#55)
3 years ago
Frank Lee da01c234e1
Develop/experiments (#59)
3 years ago
Frank Lee 3defa32aee
Support TP-compatible Torch AMP and Update trainer API (#27)
3 years ago
ver217 9942fd5bfa
remove redundancy func in setup (#19) (#20)
3 years ago
binmakeswell 05e7069a5b fixed some typos in the documents, added blog link and paper author information in README
3 years ago
zbian 404ecbdcc6 Migrated project
3 years ago