You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai/engine/gradient_handler
HELSON dceae85195
Added MoE parallel (#127)
3 years ago
..
__init__.py Added MoE parallel (#127) 3 years ago
_base_gradient_handler.py Migrated project 3 years ago
_data_parallel_gradient_handler.py add interleaved pipeline, fix naive amp and update pipeline model initializer (#80) 3 years ago
_moe_gradient_handler.py Added MoE parallel (#127) 3 years ago
_pipeline_parallel_gradient_handler.py Optimize pipeline schedule (#94) 3 years ago
_zero_gradient_handler.py Migrated project 3 years ago