InternLM/internlm/initialize
ytxiong c219065348
feat(*): support sequence_parallel (#180)
* support sequence_parallel for no pipeline

* sequence_parallel does not support no-flash-attn

* support sequence parallel for pipeline

* add memory profiler

* Update 13B.py

* add memory profiler

* fix evaluation bug

* remove some unnecessary code

* remove some unnecessary code

* Update parallel_context.py

* modify the config

* remove memory profiler

* modify the config

* support selective dropout
2023-08-07 16:42:52 +08:00
..
__init__.py initial commit 2023-07-06 12:55:23 +08:00
initialize_tensor.py initial commit 2023-07-06 12:55:23 +08:00
initialize_trainer.py refactor(scheduler): rewrite pipeline scheduler (#138) 2023-08-03 11:48:12 +08:00
launch.py feat(*): support sequence_parallel (#180) 2023-08-07 16:42:52 +08:00