Frank Lee
|
89f45eda5a
|
[shardformer] added development protocol for standardization (#4149)
|
2023-07-04 16:05:01 +08:00 |
Frank Lee
|
1fb0d95df0
|
[shardformer] made tensor parallelism configurable (#4144)
* [shardformer] made tensor parallelism configurable
* polish code
|
2023-07-04 16:05:01 +08:00 |
Frank Lee
|
74257cb446
|
[shardformer] refactored some doc and api (#4137)
* [shardformer] refactored some doc and api
* polish code
|
2023-07-04 16:05:01 +08:00 |
Frank Lee
|
ae035d305d
|
[shardformer] added embedding gradient check (#4124)
|
2023-07-04 16:05:01 +08:00 |
Frank Lee
|
f3b6aaa6b7
|
[shardformer] supported fused normalization (#4112)
|
2023-07-04 16:05:01 +08:00 |
Frank Lee
|
b1c2901530
|
[shardformer] supported bloom model (#4098)
|
2023-07-04 16:05:01 +08:00 |