ColossalAI/colossalai/shardformer/layer
Baizhou Zhang 208ac8f2ba [pipeline] Add Pipeline Forward for GPT2Model Shardformer (#4224)
* * fix typehint & docstring in sharder.py

* * update pipeline forward for GPT2Model

* * add test for pipeline forward of GPT2Model

* * add cache cleaning in gpt2 test

* * change assert to raise command
2023-08-15 23:25:14 +08:00
..
__init__.py [shardformer] supported fused normalization (#4112) 2023-07-04 16:05:01 +08:00
_operation.py [format] applied code formatting on changed files in pull request 4152 (#4157) 2023-07-04 16:07:47 +08:00
dropout.py [shardformer] supported bloom model (#4098) 2023-07-04 16:05:01 +08:00
embedding.py [shardformer] support lazy init (#4202) 2023-08-15 23:25:14 +08:00
linear.py [pipeline] Add Pipeline Forward for GPT2Model Shardformer (#4224) 2023-08-15 23:25:14 +08:00
loss.py fix some typo colossalai/shardformer (#4160) 2023-07-04 17:53:39 +08:00
normalization.py [shardformer] support lazy init (#4202) 2023-08-15 23:25:14 +08:00
parallel_module.py [shardformer] supported fused qkv checkpoint (#4073) 2023-07-04 16:05:01 +08:00
qkv_fused_linear.py [shardformer] support lazy init (#4202) 2023-08-15 23:25:14 +08:00
utils.py [shardformer] supported bloom model (#4098) 2023-07-04 16:05:01 +08:00