Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
HELSON 84fd7c1d4d
add moe context, moe utilities and refactor gradient handler (#455)
3 years ago
..
__init__.py moved env variables to global variables; (#215) 3 years ago
_base_gradient_handler.py Migrated project 3 years ago
_data_parallel_gradient_handler.py add moe context, moe utilities and refactor gradient handler (#455) 3 years ago
_moe_gradient_handler.py add moe context, moe utilities and refactor gradient handler (#455) 3 years ago
_pipeline_parallel_gradient_handler.py Optimize pipeline schedule (#94) 3 years ago
_sequence_parallel_gradient_handler.py add moe context, moe utilities and refactor gradient handler (#455) 3 years ago
_zero_gradient_handler.py Flake8 code restyle 3 years ago
utils.py add moe context, moe utilities and refactor gradient handler (#455) 3 years ago