ColossalAI/colossalai/shardformer/shard
FoolPlayer f7774ec0f3 [Shardformer] Downstream bert (#3979)
* add dist dropout in model

* update docstring and bert policy with dropout

* refactor basepolicy and sharded, update bert

* update format

* update gpt2 policy

* update bert policy

* remove unused code

* update readme for new policy usage

* add downstream model of bert

* remove unused code
2023-07-04 16:05:01 +08:00
..
__init__.py [shardformer] refactored the user api (#3828) 2023-07-04 16:05:01 +08:00
shard_config.py [Shardformer] Downstream bert (#3979) 2023-07-04 16:05:01 +08:00
sharder.py [Shardformer] Downstream bert (#3979) 2023-07-04 16:05:01 +08:00
slicer.py [shardformer] shardformer support t5 model (#3994) 2023-07-04 16:05:01 +08:00