Jianghai
|
24c0768795
|
[shardformer] Pytree fix (#4533)
* pytree test
* test bert
* test bert
* test bert
* revise
* add register
* add register
|
2023-09-04 17:52:23 +08:00 |
Baizhou Zhang
|
0387a47e63
|
[shardformer] fix emerged bugs after updating transformers (#4526)
|
2023-08-29 11:25:05 +08:00 |
Baizhou Zhang
|
ed4c448488
|
[pipeline] rewrite t5 tests & support multi-tensor transmitting in pipeline (#4388)
* fix remaining t5 bugs/rewrite t5 tests
* fix multi-tensor communication in pipeline
* rearrange test_config
* fix keyerror in sync_shared_params
* fix get_held_layers & Randomnizer, complete t5 tests
* erase printing
* fix get_held_layers through modifying _release_unheld_layers
* fix _get_recursive_held_layers bug
|
2023-08-15 23:25:14 +08:00 |
Hongxin Liu
|
f51ce1bc8e
|
[pipeline] refactor 1f1b schedule (#4115)
* [api] update optimizer wrapper to fit pipeline
* [pipeline] add base schedule
* [pipeline] add 1f1b schedule
* [test] add pipeline schedule utils test
* [pipeline] fix import
|
2023-08-15 23:25:14 +08:00 |