ColossalAI/tests/test_shardformer/test_model
botbw 4fa6b9509c
[moe] add parallel strategy for shared_expert && fix test for deepseek (#6063)
2024-09-18 10:09:01 +08:00
..
__init__.py [shardformer] adapted T5 and LLaMa test to use kit (#4049) 2023-07-04 16:05:01 +08:00
_utils.py [Feature] Split cross-entropy computation in SP (#5959) 2024-09-10 12:06:50 +08:00
test_shard_bert.py [CI/tests] simplify some test case to reduce testing time (#5755) 2024-06-04 13:57:54 +08:00
test_shard_blip2.py [CI/tests] simplify some test case to reduce testing time (#5755) 2024-06-04 13:57:54 +08:00
test_shard_bloom.py [CI/tests] simplify some test case to reduce testing time (#5755) 2024-06-04 13:57:54 +08:00
test_shard_chatglm2.py [Feature] Split cross-entropy computation in SP (#5959) 2024-09-10 12:06:50 +08:00
test_shard_command.py [Feature] Split cross-entropy computation in SP (#5959) 2024-09-10 12:06:50 +08:00
test_shard_deepseek.py [moe] add parallel strategy for shared_expert && fix test for deepseek (#6063) 2024-09-18 10:09:01 +08:00
test_shard_falcon.py [CI/tests] simplify some test case to reduce testing time (#5755) 2024-06-04 13:57:54 +08:00
test_shard_gpt2.py [Feature] Split cross-entropy computation in SP (#5959) 2024-09-10 12:06:50 +08:00
test_shard_gptj.py [release] update version (#5752) 2024-05-31 19:40:26 +08:00
test_shard_llama.py [Feature] Split cross-entropy computation in SP (#5959) 2024-09-10 12:06:50 +08:00
test_shard_mistral.py [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_shard_mixtral.py [hotfix] moe hybrid parallelism benchmark & follow-up fix (#6048) 2024-09-10 17:30:53 +08:00
test_shard_opt.py [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_shard_qwen2.py [Feature] Split cross-entropy computation in SP (#5959) 2024-09-10 12:06:50 +08:00
test_shard_sam.py [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_shard_t5.py [shardformer] Support the T5ForTokenClassification model (#5816) 2024-06-27 16:40:38 +08:00
test_shard_vit.py [CI/tests] simplify some test case to reduce testing time (#5755) 2024-06-04 13:57:54 +08:00
test_shard_whisper.py [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00