ColossalAI/colossalai/shardformer/policies
Frank Lee d857f3dbba [shardformer] supported T5 and its variants (#4045) 2023-07-04 16:05:01 +08:00
..
__init__.py [shardformer] init shardformer code structure (#3731) 2023-07-04 16:05:01 +08:00
autopolicy.py [shardformer] supported T5 and its variants (#4045) 2023-07-04 16:05:01 +08:00
basepolicy.py [shardformer] supported T5 and its variants (#4045) 2023-07-04 16:05:01 +08:00
bert.py [shardformer] fix bert and gpt downstream with new api (#4024) 2023-07-04 16:05:01 +08:00
gpt2.py [shardformer] Add dropout layer in shard model and refactor policy api (#3949) 2023-07-04 16:05:01 +08:00
llama.py [shardformer] adapted llama to the new API (#4036) 2023-07-04 16:05:01 +08:00
t5.py [shardformer] supported T5 and its variants (#4045) 2023-07-04 16:05:01 +08:00