Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
YuliangLiu0306 8221fd7485
[autoparallel] update binary elementwise handler (#2451)
2 years ago
..
components_to_test [testing] add beit model for unit testings (#2196) 2 years ago
test_amp [amp] add gradient clipping for unit tests (#2283) 2 years ago
test_auto_parallel [autoparallel] update binary elementwise handler (#2451) 2 years ago
test_autochunk update doc 2 years ago
test_comm [communication] add p2p_v2.py to support communication with List[Any] (#1407) 2 years ago
test_config [pipeline] refactor the pipeline module (#1087) 2 years ago
test_context [test] refactored with the new rerun decorator (#763) 3 years ago
test_data [unittest] refactored unit tests for change in dependency (#838) 3 years ago
test_data_pipeline_tensor_parallel [engin/schedule] use p2p_v2 to recontruct pipeline_schedule (#1408) 2 years ago
test_ddp [zero] add chunk init function for users (#1729) 2 years ago
test_device [device] find best logical mesh 2 years ago
test_engine [hotfix] remove potiential circle import (#1307) 2 years ago
test_fx [Pipeline] Add Topo Class (#2059) 2 years ago
test_gemini [zero] fix state_dict and load_state_dict for ddp ignored parameters (#2443) 2 years ago
test_layers improved allgather & reducescatter for 3d 2 years ago
test_moe [test] align model name with the file name. (#2045) 2 years ago
test_ops [FAW] export FAW in _ops (#1438) 2 years ago
test_optimizer [setup] support pre-build and jit-build of cuda kernels (#2374) 2 years ago
test_pipeline [PP Middleware] Add bwd and step for PP middleware (#2111) 2 years ago
test_tensor [polish] polish code for get_static_torch_model (#2405) 2 years ago
test_trainer [pipeline] refactor the pipeline module (#1087) 2 years ago
test_utils updated attention kernel (#2133) 2 years ago
test_zero [testing] add beit model for unit testings (#2196) 2 years ago
__init__.py [zero] Update sharded model v2 using sharded param v2 (#323) 3 years ago