102 Commits (feature/inference-refactor)

Author SHA1 Message Date
Xuanlei Zhao dc003c304c
[moe] merge moe into main (#4978) 1 year ago
Hongxin Liu 079bf3cb26
[misc] update pre-commit and run all files (#4752) 1 year ago
Hongxin Liu b5f9e37c70
[legacy] clean up legacy code (#4743) 1 year ago
Hongxin Liu 554aa9592e
[legacy] move communication and nn to legacy and refactor logger (#4671) 1 year ago
Hongxin Liu ac178ca5c1 [legacy] move builder and registry to legacy (#4603) 1 year ago
Frank Lee 015af592f8 [shardformer] integrated linear 1D with dtensor (#3996) 1 year ago
FoolPlayer ab8a47f830 [shardformer] add Dropout layer support different dropout pattern (#3856) 1 year ago
FoolPlayer 8cc11235c0 [shardformer]: Feature/shardformer, add some docstring and readme (#3816) 1 year ago
github-actions[bot] a52f62082d
[format] applied code formatting on changed files in pull request 4021 (#4022) 1 year ago
Frank Lee ddcf58cacf
Revert "[sync] sync feature/shardformer with develop" 1 year ago
FoolPlayer 21a3915c98 [shardformer] add Dropout layer support different dropout pattern (#3856) 1 year ago
FoolPlayer 58f6432416 [shardformer]: Feature/shardformer, add some docstring and readme (#3816) 1 year ago
digger yu 1878749753
[nfc] fix typo colossalai/nn (#3887) 1 year ago
digger-yu b9a8dff7e5
[doc] Fix typo under colossalai and doc(#3618) 2 years ago
ver217 26b7aac0be
[zero] reorganize zero/gemini folder structure (#3424) 2 years ago
HELSON 1a1d68b053
[moe] add checkpoint for moe models (#3354) 2 years ago
zbian 61e687831d fixed using zero with tp cannot access weight correctly 2 years ago
Frank Lee 40d376c566
[setup] support pre-build and jit-build of cuda kernels (#2374) 2 years ago
Jiarui Fang 16cc8e6aa7
[builder] MOE builder (#2277) 2 years ago
zbian e94c79f15b improved allgather & reducescatter for 3d 2 years ago
アマデウス 622f863291
[hotfix] Jit type hint #2161 (#2164) 2 years ago
ver217 f8a7148dec
[kernel] move all symlinks of kernel to `colossalai._C` (#1971) 2 years ago
アマデウス e52f9d9109
[tensorparallel] fixed tp layers (#1938) 2 years ago
Jiarui Fang 986f8cbaa7
[inference] overlap comm and compute in Linear1D_Row when stream_chunk_num > 1 (#1876) 2 years ago
Jiarui Fang c2947dadf1
[inference] streaming Linear 1D Row inference (#1874) 2 years ago
zbian 653b0a620e added skip_bias_add for non-tp linear 2 years ago
アマデウス 4268ae017b
[kernel] added jit warmup (#1792) 2 years ago
kurisusnowdeng 0b8161fab8 updated tp layers 2 years ago
HELSON a088022efc
[moe] fix moe bugs (#1633) 2 years ago
HELSON f7f2248771
[moe] fix MoE bugs (#1628) 2 years ago
DouJS f586887a90 [NFC] polish colossalai/nn/layer/colossalai_layer/dropout.py code style (#1568) 2 years ago
Ofey Chan 7cc052f6c0 [NFC] polish colossalai/nn/layer/colossalai_layer/linear.py (#1556) 2 years ago
ver217 10dd8226b1
add gather_output for VocabParallelClassifier1D (#1569) 2 years ago
ver217 ae71036cd2
[utils] refactor parallel layers checkpoint and bcast model on loading checkpoint (#1548) 2 years ago
runluo f83c4d6597
[NFC] polish colossalai/nn/layer/wrapper/pipeline_wrapper.py code style (#1303) 2 years ago
XYE e83b2ce853 [NFC] polish colossalai/nn/layer/vanilla/layers.py code style (#1295) 2 years ago
Liping233 1000a41fd5 [NFC] polish colossalai/nn/layer/vanilla/__init__.py code style (#1293) 2 years ago
Wangbo Zhao(黑色枷锁) 552667825b [NFC] polish colossalai/nn/layer/parallel_1d/layers.py code style (#1290) 2 years ago
Jiatong Han 38e3ccd1e9 [NFC] polish colossalai/nn/layer/parallel_sequence/layers.py code style (#1280) 2 years ago
Geng Zhang 0e06f62160 [NFC] polish colossalai/nn/layer/parallel_sequence/_operation.py code style (#1266) 2 years ago
superhao1995 f660152c73 [NFC] polish colossalai/nn/layer/parallel_3d/_operation.py code style (#1258) 2 years ago
Frank Lee 2b2dc1c86b
[pipeline] refactor the pipeline module (#1087) 2 years ago
Ziyue Jiang 0653c63eaa
[Tensor] 1d row embedding (#1075) 2 years ago
Ziheng Qin 571f12eff3 [NFC] polish colossalai/nn/layer/utils/common.py code style (#983) 3 years ago
shenggan 18542b47fc [NFC] polish colossalai/nn/layer/parallel_2d/layers.py code style (#976) 3 years ago
Zirui Zhu 598cde4a0f [NFC] polish colossalai/nn/layer/parallel_2p5d/layers.py code style (#972) 3 years ago
LuGY fb5bc6cb28 [NFC] polish colossalai/nn/layer/parallel_3d/layers.py code style (#966) 3 years ago
ver217 58580b50fe
Revert "[NFC] Hotfix/format (#984)" (#986) 3 years ago
binmakeswell 0772828fba
[NFC] Hotfix/format (#984) 3 years ago
HELSON e5ea3fdeef
[gemini] add GeminiMemoryManger (#832) 3 years ago