You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/tests/test_shardformer/test_model
haze188 2d73efdfdd
[bugfix] colo attn bug fix
4 months ago
..
__init__.py
_utils.py [test] add mixtral transformer test 4 months ago
test_shard_bert.py
test_shard_blip2.py
test_shard_bloom.py
test_shard_chatglm2.py [ShardFormer] Add Ulysses Sequence Parallelism support for Command-R, Qwen2 and ChatGLM (#5897) 5 months ago
test_shard_command.py [ShardFormer] Add Ulysses Sequence Parallelism support for Command-R, Qwen2 and ChatGLM (#5897) 5 months ago
test_shard_deepseek.py [test] add check 4 months ago
test_shard_deepseek_ghz.py [bugfix] colo attn bug fix 4 months ago
test_shard_falcon.py
test_shard_gpt2.py
test_shard_gptj.py
test_shard_llama.py [Feature] Enable PP + SP for llama (#5868) 5 months ago
test_shard_mistral.py
test_shard_mixtral.py [test] fix test: test_zero1_2 4 months ago
test_shard_opt.py
test_shard_qwen2.py [ShardFormer] fix qwen2 sp (#5903) 5 months ago
test_shard_sam.py
test_shard_t5.py [shardformer] Support the T5ForTokenClassification model (#5816) 5 months ago
test_shard_vit.py
test_shard_whisper.py