mirror of https://github.com/hpcaitech/ColossalAI
![]() * place params on cpu after zero init context * polish code * bucketzed cpu gpu tensor transter * find a bug in sharded optim unittest * add offload unittest for ShardedOptimV2. * polish code and make it more robust |
||
---|---|---|
.. | ||
init_ctx | ||
shard_utils | ||
sharded_model | ||
sharded_optim | ||
sharded_param | ||
__init__.py |