HELSON
|
ea13a201bb
|
[polish] polish code for get_static_torch_model (#2405)
* [gemini] polish code
* [testing] remove code
* [gemini] make more robust
|
2023-01-09 17:41:38 +08:00 |
Jiarui Fang
|
2e9cbfca12
|
[Gemini] add unitests to check gemini correctness (#2015)
|
2022-11-24 16:51:45 +08:00 |
Jiarui Fang
|
f7e276fa71
|
[Gemini] add GeminiAdamOptimizer (#1960)
|
2022-11-16 14:44:28 +08:00 |
Jiarui Fang
|
9f4fb3f28a
|
[ColoTensor] ColoInitContext initialize parameters in shard mode. (#1937)
|
2022-11-14 16:05:09 +08:00 |
HELSON
|
f69f9bf223
|
[zero] add chunk init function for users (#1729)
* add chunk manager init function
* fix unit tests
* add comment
* add flush=True
|
2022-10-18 16:31:22 +08:00 |
HELSON
|
b28991dd0a
|
[feature] A new ZeRO implementation (#1644)
|
2022-10-09 09:18:51 +08:00 |