ColossalAI/tests/test_moe
HELSON e6d50ec107
[zero] adapt zero for unsharded parameters (#561)
* support existing sharded and unsharded parameters in zero

* add unitest for moe-zero model init

* polish moe gradient handler
2022-03-31 18:34:11 +08:00
..
test_grad_handler.py Refactored docstring to google style 2022-03-29 17:17:47 +08:00
test_kernel.py Refactored docstring to google style 2022-03-29 17:17:47 +08:00
test_moe_group.py Refactored docstring to google style 2022-03-29 17:17:47 +08:00
test_moe_zero_init.py [zero] adapt zero for unsharded parameters (#561) 2022-03-31 18:34:11 +08:00
test_moe_zero_model.py [zero] adapt zero for unsharded parameters (#561) 2022-03-31 18:34:11 +08:00