.. |
__init__.py
|
[shardformer] adapted T5 and LLaMa test to use kit (#4049)
|
2023-07-04 16:05:01 +08:00 |
_utils.py
|
[test] add mixtral transformer test
|
2024-08-01 10:06:59 +08:00 |
test_shard_bert.py
|
[CI/tests] simplify some test case to reduce testing time (#5755)
|
2024-06-04 13:57:54 +08:00 |
test_shard_blip2.py
|
[CI/tests] simplify some test case to reduce testing time (#5755)
|
2024-06-04 13:57:54 +08:00 |
test_shard_bloom.py
|
[CI/tests] simplify some test case to reduce testing time (#5755)
|
2024-06-04 13:57:54 +08:00 |
test_shard_chatglm2.py
|
[ShardFormer] Add Ulysses Sequence Parallelism support for Command-R, Qwen2 and ChatGLM (#5897)
|
2024-07-10 11:34:25 +08:00 |
test_shard_command.py
|
[ShardFormer] Add Ulysses Sequence Parallelism support for Command-R, Qwen2 and ChatGLM (#5897)
|
2024-07-10 11:34:25 +08:00 |
test_shard_deepseek.py
|
[chore] remove redundant test case, print string & reduce test tokens
|
2024-08-01 10:06:59 +08:00 |
test_shard_falcon.py
|
[CI/tests] simplify some test case to reduce testing time (#5755)
|
2024-06-04 13:57:54 +08:00 |
test_shard_gpt2.py
|
[CI/tests] simplify some test case to reduce testing time (#5755)
|
2024-06-04 13:57:54 +08:00 |
test_shard_gptj.py
|
[release] update version (#5752)
|
2024-05-31 19:40:26 +08:00 |
test_shard_llama.py
|
[Feature] Enable PP + SP for llama (#5868)
|
2024-07-09 18:05:20 +08:00 |
test_shard_mistral.py
|
[misc] refactor launch API and tensor constructor (#5666)
|
2024-04-29 10:40:11 +08:00 |
test_shard_mixtral.py
|
[misc] remove incompatible test config
|
2024-08-01 10:06:59 +08:00 |
test_shard_opt.py
|
[misc] refactor launch API and tensor constructor (#5666)
|
2024-04-29 10:40:11 +08:00 |
test_shard_qwen2.py
|
[ShardFormer] fix qwen2 sp (#5903)
|
2024-07-15 13:58:06 +08:00 |
test_shard_sam.py
|
[misc] refactor launch API and tensor constructor (#5666)
|
2024-04-29 10:40:11 +08:00 |
test_shard_t5.py
|
[shardformer] Support the T5ForTokenClassification model (#5816)
|
2024-06-27 16:40:38 +08:00 |
test_shard_vit.py
|
[CI/tests] simplify some test case to reduce testing time (#5755)
|
2024-06-04 13:57:54 +08:00 |
test_shard_whisper.py
|
[misc] refactor launch API and tensor constructor (#5666)
|
2024-04-29 10:40:11 +08:00 |