ColossalAI/colossalai/shardformer/policies
FoolPlayer 45927d5527 [shardformer] Add dropout layer in shard model and refactor policy api (#3949)
* add dist dropout in model

* update docstring and bert policy with dropout

* refactor basepolicy and sharded, update bert

* update format

* update gpt2 policy

* update bert policy

* remove unused code

* update readme for new policy usage
2023-07-04 16:05:01 +08:00
..
__init__.py [shardformer] init shardformer code structure (#3731) 2023-07-04 16:05:01 +08:00
autopolicy.py [shardformer] add gpt2 policy and modify shard and slicer to support (#3883) 2023-07-04 16:05:01 +08:00
basepolicy.py [shardformer] Add dropout layer in shard model and refactor policy api (#3949) 2023-07-04 16:05:01 +08:00
bert.py [shardformer] Add dropout layer in shard model and refactor policy api (#3949) 2023-07-04 16:05:01 +08:00
gpt2.py [shardformer] Add dropout layer in shard model and refactor policy api (#3949) 2023-07-04 16:05:01 +08:00