ColossalAI/colossalai/zero/sharded_model
HELSON a9b8300d54
[zero] improve adaptability for not-shard parameters (#708)
* adapt post grad hooks for not-shard parameters
* adapt optimizer for not-shard parameters
* offload gradients for not-replicated parameters
2022-04-11 13:38:51 +08:00
..
__init__.py [refactor] remove old zero code (#517) 2022-03-25 14:54:39 +08:00
_utils.py [zero] add stateful tensor (#549) 2022-03-30 13:51:37 +08:00
reduce_scatter.py [zero] add sharded grad and refactor grad hooks for ShardedModel (#287) 2022-03-11 15:50:28 +08:00
sharded_model_v2.py [zero] improve adaptability for not-shard parameters (#708) 2022-04-11 13:38:51 +08:00
utils.py [polish] rename col_attr -> colo_attr (#558) 2022-03-31 12:25:45 +08:00