You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/tests/test_shardformer/test_model
Edenzzzz 43995ee436
[Feature] Distributed optimizers: Lamb, Galore, CAME and Adafactor (#5694)
7 months ago
..
__init__.py [shardformer] adapted T5 and LLaMa test to use kit (#4049) 1 year ago
_utils.py [Feature] Distributed optimizers: Lamb, Galore, CAME and Adafactor (#5694) 7 months ago
test_shard_bert.py [misc] refactor launch API and tensor constructor (#5666) 7 months ago
test_shard_blip2.py [misc] refactor launch API and tensor constructor (#5666) 7 months ago
test_shard_bloom.py [misc] refactor launch API and tensor constructor (#5666) 7 months ago
test_shard_chatglm2.py [misc] refactor launch API and tensor constructor (#5666) 7 months ago
test_shard_falcon.py [misc] refactor launch API and tensor constructor (#5666) 7 months ago
test_shard_gpt2.py [misc] refactor launch API and tensor constructor (#5666) 7 months ago
test_shard_gptj.py [shardformer] update colo attention to support custom mask (#5510) 8 months ago
test_shard_llama.py [zero]remove registered gradients hooks (#5687) 7 months ago
test_shard_mistral.py [misc] refactor launch API and tensor constructor (#5666) 7 months ago
test_shard_opt.py [misc] refactor launch API and tensor constructor (#5666) 7 months ago
test_shard_qwen2.py [Shardformer] Support the Qwen2 model (#5699) 7 months ago
test_shard_sam.py [misc] refactor launch API and tensor constructor (#5666) 7 months ago
test_shard_t5.py [misc] refactor launch API and tensor constructor (#5666) 7 months ago
test_shard_vit.py [misc] refactor launch API and tensor constructor (#5666) 7 months ago
test_shard_whisper.py [misc] refactor launch API and tensor constructor (#5666) 7 months ago