Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Yuanheng Zhao 916459c99a
[inference] Add model forward accuracy test (#5102)
1 year ago
..
kit [Inference] Dynamic Batching Inference, online and offline (#4953) 1 year ago
test_analyzer [misc] update pre-commit and run all files (#4752) 1 year ago
test_auto_parallel [misc] update pre-commit and run all files (#4752) 1 year ago
test_autochunk [misc] update pre-commit and run all files (#4752) 1 year ago
test_booster [npu] add npu support for gemini and zero (#5067) 1 year ago
test_checkpoint_io [gemini] gemini support extra-dp (#5043) 1 year ago
test_cluster [misc] update pre-commit and run all files (#4752) 1 year ago
test_config [misc] update pre-commit and run all files (#4752) 1 year ago
test_device [misc] update pre-commit and run all files (#4752) 1 year ago
test_fx [misc] update pre-commit and run all files (#4752) 1 year ago
test_gptq [feature] add gptq for inference (#4754) 1 year ago
test_infer [inference] Add model forward accuracy test (#5102) 1 year ago
test_infer_ops/triton [inference] Refactor inference architecture (#5057) 1 year ago
test_lazy [lazy] support from_pretrained (#4801) 1 year ago
test_legacy [npu] add npu support for gemini and zero (#5067) 1 year ago
test_moe [hotfix]: modify create_ep_hierarchical_group and add test (#5032) 1 year ago
test_optimizer [test] merge old components to test to model zoo (#4945) 1 year ago
test_pipeline [misc] update pre-commit and run all files (#4752) 1 year ago
test_shardformer [gemini] gemini support extra-dp (#5043) 1 year ago
test_smoothquant [inference] Add smmoothquant for llama (#4904) 1 year ago
test_tensor [misc] update pre-commit and run all files (#4752) 1 year ago
test_utils [misc] update pre-commit and run all files (#4752) 1 year ago
test_zero [npu] add npu support for gemini and zero (#5067) 1 year ago
__init__.py [zero] Update sharded model v2 using sharded param v2 (#323) 3 years ago