ColossalAI

History

Jiarui Fang 3af13a2c3e [zero] polish ShardedOptimV2 unittest (#385 ) * place params on cpu after zero init context * polish code * bucketzed cpu gpu tensor transter * find a bug in sharded optim unittest * add offload unittest for ShardedOptimV2. * polish code and make it more robust		2022-03-11 15:50:28 +08:00
..
bookkeeping	Feature/zero (#279 )	2022-03-11 15:50:28 +08:00
__init__.py	impl shard optim v2 and add unit test	2022-03-11 15:50:28 +08:00
_utils.py	Feature/zero (#279 )	2022-03-11 15:50:28 +08:00
sharded_optim.py	[zero] cpu adam kernel (#288 )	2022-03-11 15:50:28 +08:00
sharded_optim_v2.py	[zero] polish ShardedOptimV2 unittest (#385 )	2022-03-11 15:50:28 +08:00