235 Commits (457a0de79fd2d3602eba0ac78e606acb6401fc60)

Author SHA1 Message Date
littsk 11f1e426fe
[hotfix] Correct several erroneous code comments (#4794) 1 year ago
Jianghai ce7ade3882
[inference] chatglm2 infer demo (#4724) 1 year ago
Xu Kai 946ab56c48
[feature] add gptq for inference (#4754) 1 year ago
Hongxin Liu 079bf3cb26
[misc] update pre-commit and run all files (#4752) 1 year ago
Baizhou Zhang f911d5b09d
[doc] Add user document for Shardformer (#4702) 1 year ago
flybird11111 20190b49a5
[shardformer] to fix whisper test failed due to significant accuracy differences. (#4710) 1 year ago
flybird11111 c7d6975d29
[shardformer] fix GPT2DoubleHeadsModel (#4703) 1 year ago
flybird11111 8844691f4b
[shardformer] update shardformer readme (#4689) 1 year ago
Cuiqing Li bce0f16702
[Feature] The first PR to Add TP inference engine, kv-cache manager and related kernels for our inference system (#4577) 1 year ago
flybird11111 eedaa3e1ef
[shardformer]fix gpt2 double head (#4663) 1 year ago
flybird11111 7486ed7d3a
[shardformer] update llama2/opt finetune example and fix llama2 policy (#4645) 1 year ago
Baizhou Zhang 295b38fecf
[example] update vit example for hybrid parallel plugin (#4641) 1 year ago
eric8607242 c3d5fa3bac
[shardformer] Support customized policy for llamav2 based model with HybridParallelPlugin (#4624) 1 year ago
flybird11111 ec0866804c
[shardformer] update shardformer readme (#4617) 1 year ago
Bin Jia 86d22581e4
[shardformer] Add overlap optional for HybridParallelPlugin (#4615) 1 year ago
Jianghai 24c0768795
[shardformer] Pytree fix (#4533) 1 year ago
Baizhou Zhang 2c787d7f47
[shardformer] fix submodule replacement bug when enabling pp (#4544) 1 year ago
flybird11111 d367b88785
[shardformer] fix opt test hanging (#4521) 1 year ago
Bin Jia e241b74f24
[shardformer] Add overlap support for gpt2 (#4535) 1 year ago
Bin Jia c554b7f559
[shardformer/fix overlap bug] fix overlap bug, add overlap as an option in shardco… (#4516) 1 year ago
Baizhou Zhang 44eab2b27f
[shardformer] support sharded checkpoint IO for models of HybridParallelPlugin (#4506) 1 year ago
flybird11111 de8a65babc
[shardformer] opt fix. (#4514) 1 year ago
flybird11111 3353e55c80
[shardformer] vit/llama/t5 ignore the sequence parallelism flag and some fix. (#4498) 1 year ago
flybird11111 59e252ecdb
[shardformer] chatglm support sequence parallel (#4482) 1 year ago
Bin Jia 351351a36e
[shardformer/sequence parallel] not support opt of seq-parallel, add warning and fix a bug in gpt2 pp (#4488) 1 year ago
Jianghai 5545114fd8
rename chatglm to chatglm2 (#4484) 1 year ago
Jianghai 8739aa7fa0
[shardformer] Pipeline/whisper (#4456) 1 year ago
flybird11111 a27e0bb494
[shardformer] bert support sequence parallel. (#4455) 1 year ago
flybird11111 0ecd71e041
[shardformer] bloom support sequence parallel (#4465) 1 year ago
Bin Jia 7c8be77081
[shardformer/sequence parallel] support gpt2 seq parallel with pp/dp/tp (#4460) 1 year ago
Bin Jia 424629fea0
[shardformer/sequence parallel] Cherry pick commit to new branch (#4450) 1 year ago
github-actions[bot] d20dceb9a3
[format] applied code formatting on changed files in pull request 4441 (#4445) 1 year ago
ver217 5d4efdf58f [shardformer] fix import 1 year ago
ver217 73a4144b91 [shardformer] fix embedding 1 year ago
Hongxin Liu 172f7fa3cf [misc] resolve code factor issues (#4433) 1 year ago
flybird11111 108e54a0b4 [shardformer]update t5 tests for using all optimizations. (#4407) 1 year ago
flybird11111 1edc9b5fb3 [shardformer] update tests for all optimization (#4413) 1 year ago
Baizhou Zhang 7711bd524a [shardformer] rewrite tests for opt/bloom/llama/vit/chatglm (#4395) 1 year ago
flybird1111 7a3dfd0c64 [shardformer] update shardformer to use flash attention 2 (#4392) 1 year ago
Baizhou Zhang ed4c448488 [pipeline] rewrite t5 tests & support multi-tensor transmitting in pipeline (#4388) 1 year ago
flybird1111 906426cb44 [Shardformer] Merge flash attention branch to pipeline branch (#4362) 1 year ago
Jianghai a88e92251d [pipeline] add chatglm (#4363) 1 year ago
FoolPlayer 726541afe2 update some module with new api version 1 year ago
FoolPlayer 879301d0da [shardformer] support Blip2 (#4243) 1 year ago
klhhhhh 8120eca0c0 [shardformer] support ChatGLMForConditionalGeneration & add fusedlayernorm for vit 1 year ago
klhhhhh 91850fe984 [shardformer] register without auto policy 1 year ago
klhhhhh 1a29e8fc29 [shardformer] polish chatglm code 1 year ago
klhhhhh 8620009dd7 [sharformer] add first version of policy of chatglm 1 year ago
Kun Lin ed34bb1310 Feature/chatglm (#4240) 1 year ago
FoolPlayer 9ee4ebea83 [shardformer] support whisper (#4212) 1 year ago