Baizhou Zhang
083d7da33d
[pipeline] add pipeline support for all T5 models ( #4310 )
...
* complete policy for T5Model & T5ForConditionalGeneration
* modify function signature in forwards
* add forward for T5model
* add forward for T5ForConditionalGeneration
* fix a bug
* fix hidden_states transporting in decoder
* fix the passing of encoder_outputs
2023-08-15 23:25:14 +08:00
Baizhou Zhang
36e546b2cc
[pipeline] add pipeline support for T5Stack/T5EncoderModel ( #4300 )
...
* modify t5 policy & add test
* pipeline stage distribution for t5
* complete t5 base policy
* t5 stack: halfway
* modify gpt2 pipeline test
* complete pipeline forward for T5Stack/T5EncoderModel
* fix docstring
* move t5 util tests to test_pipeline
2023-08-15 23:25:14 +08:00
Jianghai
18ebcf406a
[pipeline] reformat for unified design ( #4283 )
...
* bert_reformat
* reformat
* reformat
* fix a typo
* format
* format
* fix bug
2023-08-15 23:25:14 +08:00
Baizhou Zhang
b774d5ea0f
[pipeline] refactor gpt2 pipeline forwards ( #4287 )
...
* move gpt2 pipeline forwards to modeling folder
* check pipeline status when adding replacing policy
* fix typehint
* fix arguments processing in gpt2_model_forward
2023-08-15 23:25:14 +08:00
Frank Lee
89f45eda5a
[shardformer] added development protocol for standardization ( #4149 )
2023-07-04 16:05:01 +08:00