Hongxin Liu
|
4965c0dabd
|
[lazy] support from_pretrained (#4801)
* [lazy] patch from pretrained
* [lazy] fix from pretrained and add tests
* [devops] update ci
|
2023-09-26 11:04:11 +08:00 |
Hongxin Liu
|
079bf3cb26
|
[misc] update pre-commit and run all files (#4752)
* [misc] update pre-commit
* [misc] run pre-commit
* [misc] remove useless configuration files
* [misc] ignore cuda for clang-format
|
2023-09-19 14:20:26 +08:00 |
Hongxin Liu
|
807e01a4ba
|
[zero] hotfix master param sync (#4618)
* [zero] add method to update master params
* [zero] update zero plugin
* [plugin] update low level zero plugin
|
2023-09-05 15:04:02 +08:00 |
Hongxin Liu
|
f51ce1bc8e
|
[pipeline] refactor 1f1b schedule (#4115)
* [api] update optimizer wrapper to fit pipeline
* [pipeline] add base schedule
* [pipeline] add 1f1b schedule
* [test] add pipeline schedule utils test
* [pipeline] fix import
|
2023-08-15 23:25:14 +08:00 |
Baizhou Zhang
|
58913441a1
|
Next commit [checkpointio] Unsharded Optimizer Checkpoint for Gemini Plugin (#4141)
* [checkpointio] unsharded optimizer checkpoint for Gemini plugin
* [checkpointio] unsharded optimizer checkpoint for Gemini using all_gather
|
2023-07-07 16:33:06 +08:00 |
Frank Lee
|
73d3e4d309
|
[booster] implemented the torch ddd + resnet example (#3232)
* [booster] implemented the torch ddd + resnet example
* polish code
|
2023-03-27 10:24:14 +08:00 |