You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai/shardformer/policies
Wang Binluo 537f6a3855
[Shardformer]fix the num_heads assert for llama model and qwen model (#5704)
7 months ago
..
__init__.py
auto_policy.py [Shardformer] Support the Qwen2 model (#5699) 7 months ago
base_policy.py
bert.py
blip2.py
bloom.py
chatglm2.py
falcon.py
gpt2.py
gptj.py
llama.py [Shardformer]fix the num_heads assert for llama model and qwen model (#5704) 7 months ago
mistral.py
opt.py [pre-commit.ci] auto fixes from pre-commit.com hooks 7 months ago
qwen2.py [Shardformer]fix the num_heads assert for llama model and qwen model (#5704) 7 months ago
sam.py
t5.py
vit.py
whisper.py