ColossalAI/colossalai/shardformer/modeling
FoolPlayer b3f5d7a3ba [shardformer] support pipeline base vit model (#4284)
* Feature/vit support (#4182)

* [shardformer] added tests

* [shardformer] vit test finish and support

* fix attention dropout

* support base vit pipeline

* support vit downstream model

* fix vit shard test

* modify hidden states return type

---------

Co-authored-by: Kun Lin <81014421+klhhhhh@users.noreply.github.com>
2023-08-15 23:25:14 +08:00
..
__init__.py [shardformer] added development protocol for standardization (#4149) 2023-07-04 16:05:01 +08:00
bert.py [pipeline] reformat for unified design (#4283) 2023-08-15 23:25:14 +08:00
bloom.py [pipeline] reformat for unified design (#4283) 2023-08-15 23:25:14 +08:00
gpt2.py [pipeline] refactor gpt2 pipeline forwards (#4287) 2023-08-15 23:25:14 +08:00
llama.py [pipeline] reformat for unified design (#4283) 2023-08-15 23:25:14 +08:00
t5.py [pipeline] add pipeline support for all T5 models (#4310) 2023-08-15 23:25:14 +08:00
vit.py [shardformer] support pipeline base vit model (#4284) 2023-08-15 23:25:14 +08:00