InternLM/internlm/core
Wenwen Qu 375240e039
feat(moe): add local data parallel support for experts (#376)
* add local data parallel support for experts

* fix model checkpoint for local dp mode of expert

* do not set ep size from config
2023-09-28 13:38:02 +08:00
..
communication Merge develop to main (#233) 2023-08-24 22:03:04 +08:00
context feat(moe): add local data parallel support for experts (#376) 2023-09-28 13:38:02 +08:00
scheduler feat(moe): add moe module (#182) 2023-09-27 15:54:53 +08:00
__init__.py initial commit 2023-07-06 12:55:23 +08:00
engine.py initial commit 2023-07-06 12:55:23 +08:00
gradient_handler.py initial commit 2023-07-06 12:55:23 +08:00
naive_amp.py feat(moe): add moe module (#182) 2023-09-27 15:54:53 +08:00
trainer.py feat(moe): add moe module (#182) 2023-09-27 15:54:53 +08:00