InternLM/internlm/initialize
Wenwen Qu 582ee000bd
feat(moe):support zero for expert local dp (#404)
* support zero for expert local dp

* fix above codes:
    *treat optim.zero_world_size and optim.zero_local_rank as list in model_checkpoint.py and test_model_checkpoint.py
    *add overlap and zero check for moe in args_sanity_check(.)
2023-10-09 17:45:26 +08:00
..
legacy feat(ckpt): fix checkpoint bugs and add feature enhancements. (#259) 2023-09-05 17:40:48 +08:00
__init__.py feat(numa): bind numa if possible (#320) 2023-09-25 19:34:52 +08:00
initialize_tensor.py feat(model): implement uniform_init for tensor. (#252) 2023-09-01 01:12:53 +08:00
initialize_trainer.py docs(*): add documentation and reST files for readthedocs (#272) 2023-09-06 15:36:03 +08:00
launch.py feat(moe):support zero for expert local dp (#404) 2023-10-09 17:45:26 +08:00