mirror of https://github.com/hpcaitech/ColossalAI
a9b8300d54
* adapt post grad hooks for not-shard parameters * adapt optimizer for not-shard parameters * offload gradients for not-replicated parameters |
||
---|---|---|
.. | ||
__init__.py | ||
_utils.py | ||
sharded_optim_v2.py |