InternLM

History

Wenwen Qu 582ee000bd feat(moe):support zero for expert local dp (#404 ) * support zero for expert local dp * fix above codes: treat optim.zero_world_size and optim.zero_local_rank as list in model_checkpoint.py and test_model_checkpoint.py add overlap and zero check for moe in args_sanity_check(.)		2023-10-09 17:45:26 +08:00
..
legacy	feat(ckpt): fix checkpoint bugs and add feature enhancements. (#259 )	2023-09-05 17:40:48 +08:00
__init__.py	feat(numa): bind numa if possible (#320 )	2023-09-25 19:34:52 +08:00
initialize_tensor.py	feat(model): implement uniform_init for tensor. (#252 )	2023-09-01 01:12:53 +08:00
initialize_trainer.py	docs(*): add documentation and reST files for readthedocs (#272 )	2023-09-06 15:36:03 +08:00
launch.py	feat(moe):support zero for expert local dp (#404 )	2023-10-09 17:45:26 +08:00