ColossalAI/colossalai/shardformer/layer
wukong1992 c1c672d0f0 [shardformer] shardformer support t5 model (#3994)
test t5
2023-07-04 16:05:01 +08:00
..
__init__.py [shardformer] add Dropout layer support different dropout pattern (#3856) 2023-07-04 16:05:01 +08:00
_operation.py [shardformer] add Dropout layer support different dropout pattern (#3856) 2023-07-04 16:05:01 +08:00
dist_crossentropy.py [shardformer] support llama model using shardformer (#3969) 2023-07-04 16:05:01 +08:00
dropout.py [shardformer] Unit test (#3928) 2023-07-04 16:05:01 +08:00
layers.py [shardformer] shardformer support t5 model (#3994) 2023-07-04 16:05:01 +08:00