ColossalAI/tests
Wang Binluo a3cc68ca93
[Shardformer] Support the Qwen2 model (#5699)
* feat: support qwen2 model

* fix: modify model config and add Qwen2RMSNorm

* fix qwen2 model conflicts

* test: add qwen2 shard test

* to: add qwen2 auto policy

* support qwen model

* fix the conflicts

* add try catch

* add transformers version for qwen2

* add the ColoAttention for the qwen2 model

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add the unit test version check

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix the test input bug

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix the version check

* fix the version check

---------

Co-authored-by: Wenhao Chen <cwher@outlook.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-05-09 20:04:25 +08:00
..
kit [Shardformer] Support the Qwen2 model (#5699) 2024-05-09 20:04:25 +08:00
test_analyzer [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
test_auto_parallel [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_autochunk [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
test_booster [zero]remove registered gradients hooks (#5687) 2024-05-07 12:01:38 +08:00
test_checkpoint_io [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_cluster [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_config [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
test_device [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_fx [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_gptq [devops] remove post commit ci (#5566) 2024-04-08 15:09:40 +08:00
test_infer [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_lazy [lazyinit] skip whisper test (#5653) 2024-04-26 14:03:12 +08:00
test_legacy [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_lora [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_moe [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_optimizer [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_pipeline [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_shardformer [Shardformer] Support the Qwen2 model (#5699) 2024-05-09 20:04:25 +08:00
test_smoothquant [inference] Add smmoothquant for llama (#4904) 2023-10-16 11:28:44 +08:00
test_tensor [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_zero [gemini]remove registered gradients hooks (#5696) 2024-05-09 10:29:49 +08:00
__init__.py [zero] Update sharded model v2 using sharded param v2 (#323) 2022-03-11 15:50:28 +08:00