Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
HELSON a1ce02d740
[zero] test gradient accumulation (#1964)
2 years ago
..
init_ctx [moe] fix MoE bugs (#1628) 2 years ago
shard_utils [gemini] add GeminiMemoryManger (#832) 3 years ago
sharded_model [Gemini] polish memstats collector (#1962) 2 years ago
sharded_optim [zero] test gradient accumulation (#1964) 2 years ago
sharded_param [NFC] polish colossalai/zero/sharded_param/__init__.py code style (#1717) 2 years ago
utils [Gemini] ZeROHookV2 -> GeminiZeROHook (#1972) 2 years ago
__init__.py [Gemini] add GeminiAdamOptimizer (#1960) 2 years ago