Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
github-actions[bot] d10ee42f68
[format] applied code formatting on changed files in pull request 5088 (#5127)
12 months ago
..
kit [shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088) 1 year ago
test_analyzer [misc] update pre-commit and run all files (#4752) 1 year ago
test_auto_parallel [misc] update pre-commit and run all files (#4752) 1 year ago
test_autochunk [misc] update pre-commit and run all files (#4752) 1 year ago
test_booster [format] applied code formatting on changed files in pull request 5088 (#5127) 12 months ago
test_checkpoint_io [gemini] gemini support extra-dp (#5043) 1 year ago
test_cluster [misc] update pre-commit and run all files (#4752) 1 year ago
test_config [misc] update pre-commit and run all files (#4752) 1 year ago
test_device [misc] update pre-commit and run all files (#4752) 1 year ago
test_fx [misc] update pre-commit and run all files (#4752) 1 year ago
test_gptq [feature] add gptq for inference (#4754) 1 year ago
test_infer [Hotfix] Fix model policy matching strategy in ShardFormer (#5064) 1 year ago
test_infer_ops/triton [inference] Refactor inference architecture (#5057) 1 year ago
test_lazy [lazy] support from_pretrained (#4801) 1 year ago
test_legacy [npu] add npu support for gemini and zero (#5067) 1 year ago
test_moe [hotfix]: modify create_ep_hierarchical_group and add test (#5032) 1 year ago
test_optimizer [test] merge old components to test to model zoo (#4945) 1 year ago
test_pipeline [shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088) 1 year ago
test_shardformer [shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088) 1 year ago
test_smoothquant [inference] Add smmoothquant for llama (#4904) 1 year ago
test_tensor [misc] update pre-commit and run all files (#4752) 1 year ago
test_utils [misc] update pre-commit and run all files (#4752) 1 year ago
test_zero [npu] add npu support for gemini and zero (#5067) 1 year ago
__init__.py [zero] Update sharded model v2 using sharded param v2 (#323) 3 years ago