ColossalAI/tests/kit/model_zoo/transformers
flybird11111 eedaa3e1ef
[shardformer]fix gpt2 double head (#4663)
* [shardformer]fix gpt2 test

[shardformer]fix gpt2 test

[shardformer]fix gpt2 test

* fix

* [shardformer] add todo

* [shardformer] add todo
2023-09-11 18:35:03 +08:00
..
__init__.py rename chatglm to chatglm2 (#4484) 2023-08-22 14:13:31 +08:00
albert.py [gemini] improve compatibility and add static placement policy (#4479) 2023-08-24 09:29:25 +08:00
bert.py [gemini] improve compatibility and add static placement policy (#4479) 2023-08-24 09:29:25 +08:00
blip2.py [Shardformer] Merge flash attention branch to pipeline branch (#4362) 2023-08-15 23:25:14 +08:00
bloom.py [shardformer] rewrite tests for opt/bloom/llama/vit/chatglm (#4395) 2023-08-15 23:25:14 +08:00
chatglm2.py [shardformer] chatglm support sequence parallel (#4482) 2023-08-22 23:59:31 +08:00
gpt.py [shardformer]fix gpt2 double head (#4663) 2023-09-11 18:35:03 +08:00
llama.py [shardformer] update llama2/opt finetune example and fix llama2 policy (#4645) 2023-09-09 22:45:36 +08:00
opt.py [shardformer] update llama2/opt finetune example and fix llama2 policy (#4645) 2023-09-09 22:45:36 +08:00
sam.py [shardformer] support SAM (#4231) 2023-08-15 23:25:14 +08:00
t5.py [shardformer]update t5 tests for using all optimizations. (#4407) 2023-08-15 23:25:14 +08:00
vit.py [shardformer] rewrite tests for opt/bloom/llama/vit/chatglm (#4395) 2023-08-15 23:25:14 +08:00
whisper.py [Shardformer] Merge flash attention branch to pipeline branch (#4362) 2023-08-15 23:25:14 +08:00