ColossalAI/tests/test_moe
HELSON a9b8300d54
[zero] improve adaptability for not-shard parameters (#708)
* adapt post grad hooks for not-shard parameters
* adapt optimizer for not-shard parameters
* offload gradients for not-replicated parameters
2022-04-11 13:38:51 +08:00
..
test_grad_handler.py Refactored docstring to google style 2022-03-29 17:17:47 +08:00
test_kernel.py Refactored docstring to google style 2022-03-29 17:17:47 +08:00
test_moe_group.py Refactored docstring to google style 2022-03-29 17:17:47 +08:00
test_moe_zero_init.py [zero] improve adaptability for not-shard parameters (#708) 2022-04-11 13:38:51 +08:00
test_moe_zero_model.py [zero] improve adaptability for not-shard parameters (#708) 2022-04-11 13:38:51 +08:00
test_moe_zero_optim.py [zero] improve adaptability for not-shard parameters (#708) 2022-04-11 13:38:51 +08:00