ColossalAI/colossalai/shardformer/policies
flybird11111 59e252ecdb
[shardformer] chatglm support sequence parallel (#4482)
* [shardformer] chatglm support sequence parallel

[shardformer] chatglm support sequence parallel

[shardformer] chatglm support sequence parallel

[shardformer] chatglm support sequence parallel

[shardformer] chatglm support sequence parallel

[shardformer] chatglm support sequence parallel

* fix

fix

fix

fix
2023-08-22 23:59:31 +08:00
..
__init__.py [shardformer] init shardformer code structure (#3731) 2023-07-04 16:05:01 +08:00
auto_policy.py rename chatglm to chatglm2 (#4484) 2023-08-22 14:13:31 +08:00
base_policy.py [shardformer/sequence parallel] Cherry pick commit to new branch (#4450) 2023-08-16 15:41:20 +08:00
bert.py [shardformer] chatglm support sequence parallel (#4482) 2023-08-22 23:59:31 +08:00
blip2.py [shardformer] chatglm support sequence parallel (#4482) 2023-08-22 23:59:31 +08:00
bloom.py [shardformer] chatglm support sequence parallel (#4482) 2023-08-22 23:59:31 +08:00
chatglm2.py [shardformer] chatglm support sequence parallel (#4482) 2023-08-22 23:59:31 +08:00
gpt2.py [shardformer] chatglm support sequence parallel (#4482) 2023-08-22 23:59:31 +08:00
llama.py [shardformer] chatglm support sequence parallel (#4482) 2023-08-22 23:59:31 +08:00
opt.py [shardformer/sequence parallel] not support opt of seq-parallel, add warning and fix a bug in gpt2 pp (#4488) 2023-08-22 17:35:35 +08:00
sam.py [shardformer] chatglm support sequence parallel (#4482) 2023-08-22 23:59:31 +08:00
t5.py [shardformer] Pipeline/whisper (#4456) 2023-08-18 21:29:25 +08:00
vit.py [shardformer] chatglm support sequence parallel (#4482) 2023-08-22 23:59:31 +08:00
whisper.py [shardformer] Pipeline/whisper (#4456) 2023-08-18 21:29:25 +08:00