.. |
amp
|
Optimize pipeline schedule (#94)
|
2021-12-30 15:56:46 +08:00 |
builder
|
Optimize pipeline schedule (#94)
|
2021-12-30 15:56:46 +08:00 |
communication
|
add scatter/gather optim for pipeline (#123)
|
2022-01-07 13:22:22 +08:00 |
context
|
Added MoE parallel (#127)
|
2022-01-07 15:08:36 +08:00 |
engine
|
pipeline last stage supports multi output (#151)
|
2022-01-17 15:57:47 +08:00 |
kernel
|
refactor kernel (#142)
|
2022-01-13 16:47:17 +08:00 |
logging
|
update default logger (#100) (#101)
|
2022-01-04 20:03:26 +08:00 |
nn
|
refactor kernel (#142)
|
2022-01-13 16:47:17 +08:00 |
registry
|
Develop/experiments (#59)
|
2021-12-09 15:08:29 +08:00 |
trainer
|
Update layer integration documentations (#108)
|
2022-01-10 18:05:58 +08:00 |
utils
|
Added MoE parallel (#127)
|
2022-01-07 15:08:36 +08:00 |
zero
|
try import deepspeed when using zero (#130)
|
2022-01-07 17:24:57 +08:00 |
__init__.py
|
Develop/experiments (#59)
|
2021-12-09 15:08:29 +08:00 |
constants.py
|
Added MoE parallel (#127)
|
2022-01-07 15:08:36 +08:00 |
core.py
|
Develop/experiments (#59)
|
2021-12-09 15:08:29 +08:00 |
global_variables.py
|
Added MoE parallel (#127)
|
2022-01-07 15:08:36 +08:00 |
initialize.py
|
Added MoE parallel (#127)
|
2022-01-07 15:08:36 +08:00 |