Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
ver217 7c70bfbefa
[hotfix] fix PipelineSharedModuleGradientHandler (#1314)
2 years ago
..
__init__.py fix format (#585) 3 years ago
_base_gradient_handler.py fix format (#586) 3 years ago
_data_parallel_gradient_handler.py [doc] improved docstring and assertion messages for the engine module (#871) 3 years ago
_moe_gradient_handler.py [doc] improved docstring and assertion messages for the engine module (#871) 3 years ago
_pipeline_parallel_gradient_handler.py [hotfix] fix PipelineSharedModuleGradientHandler (#1314) 2 years ago
_sequence_parallel_gradient_handler.py [doc] improved docstring and assertion messages for the engine module (#871) 3 years ago
_zero_gradient_handler.py [doc] improved docstring and assertion messages for the engine module (#871) 3 years ago
utils.py add moe context, moe utilities and refactor gradient handler (#455) 3 years ago