Frank Lee
|
015af592f8
|
[shardformer] integrated linear 1D with dtensor (#3996)
* [shardformer] integrated linear 1D with dtensor
* polish code
|
2023-07-04 16:05:01 +08:00 |
github-actions[bot]
|
a52f62082d
|
[format] applied code formatting on changed files in pull request 4021 (#4022)
Co-authored-by: github-actions <github-actions@github.com>
|
2023-06-19 11:23:24 +08:00 |
ver217
|
ae71036cd2
|
[utils] refactor parallel layers checkpoint and bcast model on loading checkpoint (#1548)
* refactor parallel layer
* broadcast rank0 model after load ckpt
|
2022-09-06 20:18:35 +08:00 |
アマデウス
|
cd13b63832
|
[model checkpoint] reworked unified layers for ease of save/load states (#593)
|
2022-04-01 16:49:56 +08:00 |
zbian
|
404ecbdcc6
|
Migrated project
|
2021-10-28 18:21:23 +02:00 |