ColossalAI/tests/kit/model_zoo/transformers
Wang Binluo a3cc68ca93
[Shardformer] Support the Qwen2 model (#5699)
* feat: support qwen2 model

* fix: modify model config and add Qwen2RMSNorm

* fix qwen2 model conflicts

* test: add qwen2 shard test

* to: add qwen2 auto policy

* support qwen model

* fix the conflicts

* add try catch

* add transformers version for qwen2

* add the ColoAttention for the qwen2 model

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add the unit test version check

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix the test input bug

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix the version check

* fix the version check

---------

Co-authored-by: Wenhao Chen <cwher@outlook.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-05-09 20:04:25 +08:00
..
__init__.py [Shardformer] Support the Qwen2 model (#5699) 2024-05-09 20:04:25 +08:00
albert.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
bert.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00
blip2.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00
bloom.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00
chatglm2.py [shardformer] fix chatglm implementation (#5644) 2024-04-25 14:41:17 +08:00
falcon.py [shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088) 2023-11-28 16:54:42 +08:00
gpt.py [shardformer] Sequence Parallelism Optimization (#5533) 2024-04-03 17:15:47 +08:00
gptj.py [workflow] fixed oom tests (#5275) 2024-01-16 18:55:13 +08:00
llama.py [shardformer] update transformers (#5583) 2024-04-24 22:51:50 +08:00
mistral.py [shardformer] update transformers (#5583) 2024-04-24 22:51:50 +08:00
opt.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00
qwen2.py [Shardformer] Support the Qwen2 model (#5699) 2024-05-09 20:04:25 +08:00
sam.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00
t5.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00
vit.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00
whisper.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00