ColossalAI/colossalai/shardformer/modeling
Jianghai a88e92251d [pipeline] add chatglm (#4363)
* add pipeline policy and bert forward to be done

* add bertmodel pipeline forward and make tests

* add Bert_Policy and test for policy

* update formatting

* update formatting

* update the code

* fix bugs

* fix name confilt

* add bloom model and policy ,revise the base class of policy

* revise

* revision

* add bert_for_pretraining

* add bert_for_pretraining forward and policy

* fix typos

* cancel warning

* change the imediate output to default dict

* change the default output of get_shared_params

* add chatglm

* add

* chatglm

* chatglm

* finish chatglm

* deletes

* fix rmsnorm

* chatglm

* fix chatglm shard

* init
2023-08-15 23:25:14 +08:00
..
chatglm2_6b [pipeline] add chatglm (#4363) 2023-08-15 23:25:14 +08:00
__init__.py [shardformer] added development protocol for standardization (#4149) 2023-07-04 16:05:01 +08:00
bert.py [pipeline] fix return_dict/fix pure_pipeline_test (#4331) 2023-08-15 23:25:14 +08:00
blip2.py [shardformer] support Blip2 (#4243) 2023-08-15 23:25:14 +08:00
bloom.py [pipeline] fix return_dict/fix pure_pipeline_test (#4331) 2023-08-15 23:25:14 +08:00
chatglm.py [pipeline] add chatglm (#4363) 2023-08-15 23:25:14 +08:00
gpt2.py [pipeline] fix return_dict/fix pure_pipeline_test (#4331) 2023-08-15 23:25:14 +08:00
llama.py [plugin] add 3d parallel plugin (#4295) 2023-08-15 23:25:14 +08:00
sam.py [shardformer] support Blip2 (#4243) 2023-08-15 23:25:14 +08:00
t5.py [pipeline] add pipeline support for all T5 models (#4310) 2023-08-15 23:25:14 +08:00
vit.py [shardformer] support pipeline base vit model (#4284) 2023-08-15 23:25:14 +08:00