mirror of https://github.com/hpcaitech/ColossalAI
![]() * adapt post grad hooks for not-shard parameters * adapt optimizer for not-shard parameters * offload gradients for not-replicated parameters |
||
---|---|---|
.. | ||
test_grad_handler.py | ||
test_kernel.py | ||
test_moe_group.py | ||
test_moe_zero_init.py | ||
test_moe_zero_model.py | ||
test_moe_zero_optim.py |