mirror of https://github.com/InternLM/InternLM
![]() * add local data parallel support for experts * fix model checkpoint for local dp mode of expert * do not set ep size from config |
||
---|---|---|
.. | ||
optimizer | ||
__init__.py | ||
beta2_scheduler.py | ||
lr_scheduler.py | ||
pipeline_utils.py |