ColossalAI/colossalai/engine
HELSON e6d50ec107
[zero] adapt zero for unsharded parameters (#561)
* support existing sharded and unsharded parameters in zero

* add unitest for moe-zero model init

* polish moe gradient handler
2022-03-31 18:34:11 +08:00
..
gradient_handler [zero] adapt zero for unsharded parameters (#561) 2022-03-31 18:34:11 +08:00
ophooks [zero] adapt zero for unsharded parameters (#561) 2022-03-31 18:34:11 +08:00
paramhooks Fix/format colossalai/engine/paramhooks/(#350) 2022-03-11 15:50:28 +08:00
schedule Refactored docstring to google style 2022-03-29 17:17:47 +08:00
__init__.py
_base_engine.py Refactored docstring to google style 2022-03-29 17:17:47 +08:00