ColossalAI/colossalai/zero/sharded_optim
Jiarui Fang 3af13a2c3e [zero] polish ShardedOptimV2 unittest (#385)
* place params on cpu after zero init context

* polish code

* bucketzed cpu gpu tensor transter

* find a bug in sharded optim unittest

* add offload unittest for ShardedOptimV2.

* polish code and make it more robust
2022-03-11 15:50:28 +08:00
..
bookkeeping Feature/zero (#279) 2022-03-11 15:50:28 +08:00
__init__.py impl shard optim v2 and add unit test 2022-03-11 15:50:28 +08:00
_utils.py Feature/zero (#279) 2022-03-11 15:50:28 +08:00
sharded_optim.py [zero] cpu adam kernel (#288) 2022-03-11 15:50:28 +08:00
sharded_optim_v2.py [zero] polish ShardedOptimV2 unittest (#385) 2022-03-11 15:50:28 +08:00