mirror of https://github.com/hpcaitech/ColossalAI
3af13a2c3e
* place params on cpu after zero init context * polish code * bucketzed cpu gpu tensor transter * find a bug in sharded optim unittest * add offload unittest for ShardedOptimV2. * polish code and make it more robust |
||
---|---|---|
.. | ||
bookkeeping | ||
__init__.py | ||
_utils.py | ||
sharded_optim.py | ||
sharded_optim_v2.py |