ColossalAI/tests/test_shardformer/test_model
flybird11111 eedaa3e1ef
[shardformer]fix gpt2 double head (#4663)
* [shardformer]fix gpt2 test

[shardformer]fix gpt2 test

[shardformer]fix gpt2 test

* fix

* [shardformer] add todo

* [shardformer] add todo
2023-09-11 18:35:03 +08:00
..
__init__.py [shardformer] adapted T5 and LLaMa test to use kit (#4049) 2023-07-04 16:05:01 +08:00
_utils.py [shardformer]fix gpt2 double head (#4663) 2023-09-11 18:35:03 +08:00
test_shard_bert.py [shardformer] Pytree fix (#4533) 2023-09-04 17:52:23 +08:00
test_shard_blip2.py [Shardformer] Merge flash attention branch to pipeline branch (#4362) 2023-08-15 23:25:14 +08:00
test_shard_bloom.py [shardformer] support pp+tp+zero1 tests (#4531) 2023-08-30 21:29:18 +08:00
test_shard_chatglm2.py [shardformer] fix submodule replacement bug when enabling pp (#4544) 2023-08-31 09:57:18 +08:00
test_shard_gpt2.py [shardformer]fix gpt2 double head (#4663) 2023-09-11 18:35:03 +08:00
test_shard_llama.py [shardformer] support pp+tp+zero1 tests (#4531) 2023-08-30 21:29:18 +08:00
test_shard_opt.py [shardformer] fix submodule replacement bug when enabling pp (#4544) 2023-08-31 09:57:18 +08:00
test_shard_sam.py [Shardformer] Merge flash attention branch to pipeline branch (#4362) 2023-08-15 23:25:14 +08:00
test_shard_t5.py [shardformer] support pp+tp+zero1 tests (#4531) 2023-08-30 21:29:18 +08:00
test_shard_vit.py [shardformer] support pp+tp+zero1 tests (#4531) 2023-08-30 21:29:18 +08:00
test_shard_whisper.py [shardformer] support pp+tp+zero1 tests (#4531) 2023-08-30 21:29:18 +08:00