mirror of https://github.com/InternLM/InternLM
![]() * support sequence_parallel for no pipeline * sequence_parallel does not support no-flash-attn * support sequence parallel for pipeline * add memory profiler * Update 13B.py * add memory profiler * fix evaluation bug * remove some unnecessary code * remove some unnecessary code * Update parallel_context.py * modify the config * remove memory profiler * modify the config * support selective dropout |
||
---|---|---|
.. | ||
communication | ||
context | ||
scheduler | ||
__init__.py | ||
engine.py | ||
gradient_handler.py | ||
naive_amp.py | ||
trainer.py |