You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai/shardformer/policies
flybird1111 906426cb44
[Shardformer] Merge flash attention branch to pipeline branch (#4362)
1 year ago
..
__init__.py [shardformer] init shardformer code structure (#3731) 1 year ago
auto_policy.py [shardformer] support Blip2 (#4243) 1 year ago
base_policy.py [shardformer] fix base policy (#4229) 1 year ago
bert.py [Shardformer] Merge flash attention branch to pipeline branch (#4362) 1 year ago
blip2.py [Shardformer] Merge flash attention branch to pipeline branch (#4362) 1 year ago
bloom.py [Shardformer] Merge flash attention branch to pipeline branch (#4362) 1 year ago
chatglm.py [Shardformer] Merge flash attention branch to pipeline branch (#4362) 1 year ago
gpt2.py [Shardformer] Merge flash attention branch to pipeline branch (#4362) 1 year ago
llama.py [Shardformer] Merge flash attention branch to pipeline branch (#4362) 1 year ago
opt.py [Shardformer] Merge flash attention branch to pipeline branch (#4362) 1 year ago
sam.py [Shardformer] Merge flash attention branch to pipeline branch (#4362) 1 year ago
t5.py [Shardformer] Merge flash attention branch to pipeline branch (#4362) 1 year ago
vit.py [Shardformer] Merge flash attention branch to pipeline branch (#4362) 1 year ago
whisper.py [Shardformer] Merge flash attention branch to pipeline branch (#4362) 1 year ago