Hongxin Liu
|
079bf3cb26
|
[misc] update pre-commit and run all files (#4752)
* [misc] update pre-commit
* [misc] run pre-commit
* [misc] remove useless configuration files
* [misc] ignore cuda for clang-format
|
2023-09-19 14:20:26 +08:00 |
Frank Lee
|
b1c2901530
|
[shardformer] supported bloom model (#4098)
|
2023-07-04 16:05:01 +08:00 |
Frank Lee
|
f22ddacef0
|
[shardformer] refactored the shardformer layer structure (#4053)
|
2023-07-04 16:05:01 +08:00 |
Frank Lee
|
3893fa1a8d
|
[shardformer] refactored embedding and dropout to parallel module (#4013)
* [shardformer] refactored embedding and dropout to parallel module
* polish code
|
2023-07-04 16:05:01 +08:00 |