You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai/zero/sharded_model
Jiarui Fang 7ef3507ace
[zero] show model data cuda memory usage after zero context init. (#515)
3 years ago
..
__init__.py Feature/zero (#279) 3 years ago
_zero3_utils.py [zero] Update sharded model v2 using sharded param v2 (#323) 3 years ago
param_manager.py Feature/zero (#279) 3 years ago
reduce_scatter.py [zero] add sharded grad and refactor grad hooks for ShardedModel (#287) 3 years ago
sharded_grad.py [zero] add sharded grad and refactor grad hooks for ShardedModel (#287) 3 years ago
sharded_model.py [zero] Update sharded model v2 using sharded param v2 (#323) 3 years ago
sharded_model_v2.py [zero] show model data cuda memory usage after zero context init. (#515) 3 years ago
utils.py [zero] polish sharded param name (#484) 3 years ago