mirror of https://github.com/InternLM/InternLM
* add local data parallel support for experts * fix model checkpoint for local dp mode of expert * do not set ep size from config |
||
|---|---|---|
| .. | ||
| optimizer | ||
| __init__.py | ||
| beta2_scheduler.py | ||
| lr_scheduler.py | ||
| pipeline_utils.py | ||