Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
HELSON b28991dd0a
[feature] A new ZeRO implementation (#1644)
2 years ago
..
common.py [moe] fix MoE bugs (#1628) 2 years ago
test_found_inf.py Revert "[zero] add ZeroTensorShardStrategy (#793)" (#806) 3 years ago
test_init_context.py Revert "[zero] add ZeroTensorShardStrategy (#793)" (#806) 3 years ago
test_mem_collector.py Revert "[zero] add ZeroTensorShardStrategy (#793)" (#806) 3 years ago
test_shard_model_v2.py Revert "[zero] add ZeroTensorShardStrategy (#793)" (#806) 3 years ago
test_shard_param.py [gemini] add GeminiMemoryManger (#832) 3 years ago
test_sharded_optim_state_dict.py [colotensor] add Tensor.view op and its unit test (#1343) 2 years ago
test_sharded_optim_v2.py
test_sharded_optim_with_sync_bn.py
test_state_dict.py [hotfix] shared model returns cpu state_dict (#1328) 2 years ago
test_tensor_utils.py [test] ignore 8 gpu test (#1080) 2 years ago
test_zero_engine.py