40 Commits (8ecff0cb7fc764782ce0adec293c372f83e590bd)

Author SHA1 Message Date
duanjunwen e0c68ab6d3
[Zerobubble] merge main. (#6142) 3 days ago
wangbluo fd92789af2 fix 1 month ago
wangbluo 6be9862aaf fix 1 month ago
wangbluo 3dc08c8a5a fix 1 month ago
wangbluo 8ff7d0c780 fix 1 month ago
wangbluo 3201377e94 fix 1 month ago
wangbluo 23199e34cc fix 1 month ago
wangbluo 703bb5c18d fix the test 1 month ago
wangbluo 4e0e99bb6a fix the test 1 month ago
Hongxin Liu dc2cdaf3e8
[shardformer] optimize seq parallelism (#6086) 1 month ago
Hongxin Liu 646b3c5a90
[shardformer] fix linear 1d row and support uneven splits for fused qkv linear (#6084) 1 month ago
Wang Binluo eea37da6fa
[fp8] Merge feature/fp8_comm to main branch of Colossalai (#6016) 3 months ago
Edenzzzz f5c84af0b0
[Feature] Zigzag Ring attention (#5905) 3 months ago
Hongxin Liu 7f8b16635b
[misc] refactor launch API and tensor constructor (#5666) 7 months ago
flybird11111 a0ad587c24
[shardformer] refactor embedding resize (#5603) 7 months ago
Zhongkai Zhao 8e412a548e
[shardformer] Sequence Parallelism Optimization (#5533) 8 months ago
Insu Jang 00525f7772
[shardformer] fix pipeline forward error if custom layer distribution is used (#5189) 8 months ago
flybird11111 29695cf70c
[example]add gpt2 benchmark example script. (#5295) 9 months ago
flybird11111 79718fae04
[shardformer] llama support DistCrossEntropy (#5176) 12 months ago
Hongxin Liu 079bf3cb26
[misc] update pre-commit and run all files (#4752) 1 year ago
Bin Jia e241b74f24
[shardformer] Add overlap support for gpt2 (#4535) 1 year ago
Bin Jia c554b7f559
[shardformer/fix overlap bug] fix overlap bug, add overlap as an option in shardco… (#4516) 1 year ago
Bin Jia 424629fea0
[shardformer/sequence parallel] Cherry pick commit to new branch (#4450) 1 year ago
FoolPlayer 726541afe2 update some module with new api version 1 year ago
FoolPlayer dd2bf02679 [shardformer] support SAM (#4231) 1 year ago
Hongxin Liu d921ce8391 [shardformer] support inplace sharding (#4251) 1 year ago
Hongxin Liu 890774b2fb [shardformer] support lazy init (#4202) 1 year ago
Jianghai f3bcc292c8 [pipeline] move bert related pipeline components to shardformer (#4187) 1 year ago
github-actions[bot] c77b3b19be
[format] applied code formatting on changed files in pull request 4152 (#4157) 1 year ago
Frank Lee b1c2901530 [shardformer] supported bloom model (#4098) 1 year ago
Kun Lin 8af29ee47a [shardformer] support vision transformer (#4096) 1 year ago
Frank Lee d33a44e8c3 [shardformer] refactored layernorm (#4086) 1 year ago
FoolPlayer 92f6791095 [shardformer] Add layernorm (#4072) 1 year ago
Frank Lee 70c58cfd4f [shardformer] supported fused qkv checkpoint (#4073) 1 year ago
FoolPlayer 0803a61412 [shardformer] add linearconv1d test (#4067) 1 year ago
Frank Lee 8eb09a4c69 [shardformer] support module saving and loading (#4062) 1 year ago
Frank Lee f22ddacef0 [shardformer] refactored the shardformer layer structure (#4053) 1 year ago
FoolPlayer 507c0ad368 add vocabembedding layer 1 year ago
Frank Lee 3893fa1a8d [shardformer] refactored embedding and dropout to parallel module (#4013) 1 year ago
Frank Lee 015af592f8 [shardformer] integrated linear 1D with dtensor (#3996) 1 year ago