ColossalAI/colossalai/shardformer/policies
FoolPlayer f1cb5ac6bf [shardformer] Align bert value (#3907)
* add bert align test, fix dist loss bug

* forward and backward align

* add ignore index

* add shardformer CI

* add gather_output optional for user in shardconfig

* update readme with optional gather_ouput

* add dist crossentropy loss test, remove unused files

* remove unused file

* remove unused file

* rename the file

* polish code
2023-07-04 16:05:01 +08:00
..
__init__.py [shardformer] init shardformer code structure (#3731) 2023-07-04 16:05:01 +08:00
autopolicy.py [shardformer] add gpt2 policy and modify shard and slicer to support (#3883) 2023-07-04 16:05:01 +08:00
basepolicy.py [shardformer] add gpt2 policy and modify shard and slicer to support (#3883) 2023-07-04 16:05:01 +08:00
bert.py [shardformer] Align bert value (#3907) 2023-07-04 16:05:01 +08:00
gpt2.py [shardformer] add gpt2 policy and modify shard and slicer to support (#3883) 2023-07-04 16:05:01 +08:00