mirror of https://github.com/InternLM/InternLM
* catch exception of all ranks * monitor task only if DO_ALERT is True |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| alert.py | ||
| monitor.py | ||
| utils.py | ||
* catch exception of all ranks * monitor task only if DO_ALERT is True |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| alert.py | ||
| monitor.py | ||
| utils.py | ||