Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
HELSON 84fd7c1d4d
add moe context, moe utilities and refactor gradient handler (#455)
3 years ago
..
components_to_test [zero] new interface for ShardedOptimv2 (#406) 3 years ago
test_amp [test] fixed amp convergence comparison test (#454) 3 years ago
test_comm Hotfix/Colossalai layers (#92) 3 years ago
test_config [profiler] primary memory tracer 3 years ago
test_context optimized context test time consumption (#446) 3 years ago
test_data added CI for unit testing (#69) 3 years ago
test_data_pipeline_tensor_parallel optimize engine and trainer test (#448) 3 years ago
test_engine optimize engine and trainer test (#448) 3 years ago
test_layers fixed padding index issue for vocab parallel embedding layers; updated 3D linear to be compatible with examples in the tutorial 3 years ago
test_moe add moe context, moe utilities and refactor gradient handler (#455) 3 years ago
test_optimizer [hotfix] run cpu adam unittest in pytest (#424) 3 years ago
test_trainer optimize engine and trainer test (#448) 3 years ago
test_utils [test] optimized zero data parallel test (#452) 3 years ago
test_zero_data_parallel [test] fixed amp convergence comparison test (#454) 3 years ago
test_zero_tensor_parallel Feature/zero (#279) 3 years ago
__init__.py [zero] Update sharded model v2 using sharded param v2 (#323) 3 years ago