Jianghai
|
ef4c14a5e2
|
[Inference] Fix bug in ChatGLM2 Tensor Parallelism (#5014)
* fix bug
* fix
* fix multiquery
* fix multiquery
---------
Co-authored-by: CjhHa1 <cjh18671720497outlook.com>
|
2023-11-07 15:01:50 +08:00 |
littsk
|
1a3315e336
|
[hotfix] Add layer norm gradients all-reduce for sequence parallel (#4926)
* [hotfix] Add layer norm gradients all-reduce for sequence parallel. (#4915)
* Add layer norm gradients all-reduce for sequence parallel.
* skip pipeline inference test
* [hotfix] fixing polices of sequence parallel (#4922)
* Add layer norm gradients all-reduce for sequence parallel.
* fix parameter passing when calling get_autopolicy
---------
Co-authored-by: littsk <1214689160@qq.com>
* Hotfix/add grad all reduce for sequence parallel (#4927)
* Add layer norm gradients all-reduce for sequence parallel.
* fix parameter passing when calling get_autopolicy
* fix bug using wrong variables
---------
Co-authored-by: littsk <1214689160@qq.com>
* fix policy initialization
* fix bloom and chatglm policices
* polish code of handling layernorm
* fix moe module
* polish code of class initializing
---------
Co-authored-by: Zhongkai Zhao <kanezz620@gmail.com>
|
2023-11-03 13:32:43 +08:00 |
Hongxin Liu
|
079bf3cb26
|
[misc] update pre-commit and run all files (#4752)
* [misc] update pre-commit
* [misc] run pre-commit
* [misc] remove useless configuration files
* [misc] ignore cuda for clang-format
|
2023-09-19 14:20:26 +08:00 |
Jianghai
|
24c0768795
|
[shardformer] Pytree fix (#4533)
* pytree test
* test bert
* test bert
* test bert
* revise
* add register
* add register
|
2023-09-04 17:52:23 +08:00 |
Bin Jia
|
c554b7f559
|
[shardformer/fix overlap bug] fix overlap bug, add overlap as an option in shardco… (#4516)
* fix overlap bug and support bert, add overlap as an option in shardconfig
* support overlap for chatglm and bloom
|
2023-08-28 17:16:40 +08:00 |
flybird11111
|
59e252ecdb
|
[shardformer] chatglm support sequence parallel (#4482)
* [shardformer] chatglm support sequence parallel
[shardformer] chatglm support sequence parallel
[shardformer] chatglm support sequence parallel
[shardformer] chatglm support sequence parallel
[shardformer] chatglm support sequence parallel
[shardformer] chatglm support sequence parallel
* fix
fix
fix
fix
|
2023-08-22 23:59:31 +08:00 |
Jianghai
|
5545114fd8
|
rename chatglm to chatglm2 (#4484)
|
2023-08-22 14:13:31 +08:00 |