ColossalAI/colossalai/shardformer/layer
Bin Jia e241b74f24
[shardformer] Add overlap support for gpt2 (#4535)
* add overlap support for gpt2

* remove unused code

* remove unused code
2023-08-29 18:30:50 +08:00
..
__init__.py [shardformer] fix import 2023-08-15 23:25:14 +08:00
_operation.py [shardformer] Add overlap support for gpt2 (#4535) 2023-08-29 18:30:50 +08:00
dropout.py
embedding.py [shardformer] fix embedding 2023-08-15 23:25:14 +08:00
linear.py [shardformer/fix overlap bug] fix overlap bug, add overlap as an option in shardco… (#4516) 2023-08-28 17:16:40 +08:00
loss.py
normalization.py
parallel_module.py [shardformer] support sharded checkpoint IO for models of HybridParallelPlugin (#4506) 2023-08-25 22:04:57 +08:00
qkv_fused_linear.py [shardformer] Add overlap support for gpt2 (#4535) 2023-08-29 18:30:50 +08:00
utils.py