ColossalAI/tests/test_shardformer/test_model
Jianghai 8739aa7fa0
[shardformer] Pipeline/whisper (#4456)
* add some base tests and policies

* finish whisper base model

* add conditional generation

* finish basic tests

* whisper

* finish whisper

* finish whisper

* del useless  whisper test

* fix

* add argmin to replace

* finish revision
2023-08-18 21:29:25 +08:00
..
__init__.py [shardformer] adapted T5 and LLaMa test to use kit (#4049) 2023-07-04 16:05:01 +08:00
_utils.py [shardformer] support DDP in HybridPlugin/add tp+dp tests (#4446) 2023-08-16 16:11:57 +08:00
test_shard_bert.py [shardformer] support DDP in HybridPlugin/add tp+dp tests (#4446) 2023-08-16 16:11:57 +08:00
test_shard_blip2.py [Shardformer] Merge flash attention branch to pipeline branch (#4362) 2023-08-15 23:25:14 +08:00
test_shard_bloom.py [shardformer] support DDP in HybridPlugin/add tp+dp tests (#4446) 2023-08-16 16:11:57 +08:00
test_shard_chatglm.py [shardformer] support DDP in HybridPlugin/add tp+dp tests (#4446) 2023-08-16 16:11:57 +08:00
test_shard_gpt2.py [shardformer/sequence parallel] support gpt2 seq parallel with pp/dp/tp (#4460) 2023-08-18 11:21:53 +08:00
test_shard_llama.py [shardformer] Pipeline/whisper (#4456) 2023-08-18 21:29:25 +08:00
test_shard_opt.py [shardformer] support DDP in HybridPlugin/add tp+dp tests (#4446) 2023-08-16 16:11:57 +08:00
test_shard_sam.py [Shardformer] Merge flash attention branch to pipeline branch (#4362) 2023-08-15 23:25:14 +08:00
test_shard_t5.py [shardformer] support DDP in HybridPlugin/add tp+dp tests (#4446) 2023-08-16 16:11:57 +08:00
test_shard_vit.py [shardformer] support DDP in HybridPlugin/add tp+dp tests (#4446) 2023-08-16 16:11:57 +08:00
test_shard_whisper.py [shardformer] Pipeline/whisper (#4456) 2023-08-18 21:29:25 +08:00