.. |
amp
|
[hotfix] fix initialize bug with zero (#442)
|
2022-03-17 13:16:22 +08:00 |
builder
|
add pytorch hooks (#179)
|
2022-01-25 22:20:54 +08:00 |
communication
|
fix format (#332)
|
2022-03-11 15:50:28 +08:00 |
context
|
add moe context, moe utilities and refactor gradient handler (#455)
|
2022-03-18 16:38:32 +08:00 |
engine
|
add moe context, moe utilities and refactor gradient handler (#455)
|
2022-03-18 16:38:32 +08:00 |
kernel
|
[formart] format fixed for kernel\cuda_native codes (#335)
|
2022-03-11 15:50:28 +08:00 |
logging
|
[log] better logging display with rich (#426)
|
2022-03-16 09:51:15 +08:00 |
nn
|
[MOE] changed parallelmode to dist process group (#460)
|
2022-03-19 13:46:29 +08:00 |
registry
|
add pytorch hooks (#179)
|
2022-01-25 22:20:54 +08:00 |
testing
|
optimized context test time consumption (#446)
|
2022-03-17 14:40:52 +08:00 |
trainer
|
Added profiler communication operations
|
2022-03-11 15:50:28 +08:00 |
utils
|
add moe context, moe utilities and refactor gradient handler (#455)
|
2022-03-18 16:38:32 +08:00 |
zero
|
[doc] Update docstring for ZeRO (#459)
|
2022-03-18 16:48:20 +08:00 |
__init__.py
|
Develop/experiments (#59)
|
2021-12-09 15:08:29 +08:00 |
constants.py
|
fix format constants.py (#358)
|
2022-03-11 15:50:28 +08:00 |
core.py
|
add moe context, moe utilities and refactor gradient handler (#455)
|
2022-03-18 16:38:32 +08:00 |
global_variables.py
|
Optimized MoE layer and fixed some bugs;
|
2022-03-11 15:50:28 +08:00 |
initialize.py
|
[zero] Update initialize for ZeRO (#458)
|
2022-03-18 16:18:31 +08:00 |