mirror of https://github.com/hpcaitech/ColossalAI
e6d50ec107
* support existing sharded and unsharded parameters in zero * add unitest for moe-zero model init * polish moe gradient handler |
||
---|---|---|
.. | ||
gradient_handler | ||
ophooks | ||
paramhooks | ||
schedule | ||
__init__.py | ||
_base_engine.py |