..
__init__.py
[shardformer] adapted T5 and LLaMa test to use kit ( #4049 )
2023-07-04 16:05:01 +08:00
_utils.py
[Feature] auto-cast optimizers to distributed version ( #5746 )
2024-05-24 17:24:16 +08:00
test_shard_bert.py
[misc] refactor launch API and tensor constructor ( #5666 )
2024-04-29 10:40:11 +08:00
test_shard_blip2.py
[misc] refactor launch API and tensor constructor ( #5666 )
2024-04-29 10:40:11 +08:00
test_shard_bloom.py
[misc] refactor launch API and tensor constructor ( #5666 )
2024-04-29 10:40:11 +08:00
test_shard_chatglm2.py
[misc] refactor launch API and tensor constructor ( #5666 )
2024-04-29 10:40:11 +08:00
test_shard_falcon.py
[misc] refactor launch API and tensor constructor ( #5666 )
2024-04-29 10:40:11 +08:00
test_shard_gpt2.py
[misc] refactor launch API and tensor constructor ( #5666 )
2024-04-29 10:40:11 +08:00
test_shard_gptj.py
[release] update version ( #5752 )
2024-05-31 19:40:26 +08:00
test_shard_llama.py
[zero]remove registered gradients hooks ( #5687 )
2024-05-07 12:01:38 +08:00
test_shard_mistral.py
[misc] refactor launch API and tensor constructor ( #5666 )
2024-04-29 10:40:11 +08:00
test_shard_opt.py
[misc] refactor launch API and tensor constructor ( #5666 )
2024-04-29 10:40:11 +08:00
test_shard_qwen2.py
[Shardformer] Support the Qwen2 model ( #5699 )
2024-05-09 20:04:25 +08:00
test_shard_sam.py
[misc] refactor launch API and tensor constructor ( #5666 )
2024-04-29 10:40:11 +08:00
test_shard_t5.py
[misc] refactor launch API and tensor constructor ( #5666 )
2024-04-29 10:40:11 +08:00
test_shard_vit.py
[misc] refactor launch API and tensor constructor ( #5666 )
2024-04-29 10:40:11 +08:00
test_shard_whisper.py
[misc] refactor launch API and tensor constructor ( #5666 )
2024-04-29 10:40:11 +08:00