Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
flybird11111 5e16bf7980
[shardformer] fix gathering output when using tensor parallelism (#5431)
8 months ago
..
kit [example]add gpt2 benchmark example script. (#5295) 9 months ago
test_analyzer [misc] update pre-commit and run all files (#4752) 1 year ago
test_auto_parallel [npu] change device to accelerator api (#5239) 11 months ago
test_autochunk [misc] update pre-commit and run all files (#4752) 1 year ago
test_booster [shardformer] fix gathering output when using tensor parallelism (#5431) 8 months ago
test_checkpoint_io [devops] fix compatibility (#5444) 8 months ago
test_cluster [misc] update pre-commit and run all files (#4752) 1 year ago
test_config [misc] update pre-commit and run all files (#4752) 1 year ago
test_device [misc] update pre-commit and run all files (#4752) 1 year ago
test_fx [misc] update pre-commit and run all files (#4752) 1 year ago
test_gptq [feature] add gptq for inference (#4754) 1 year ago
test_infer [Hotfix] Fix model policy matching strategy in ShardFormer (#5064) 1 year ago
test_lazy [example]add gpt2 benchmark example script. (#5295) 9 months ago
test_legacy [npu] change device to accelerator api (#5239) 11 months ago
test_moe [moe] fix tests 10 months ago
test_optimizer [lr-scheduler] fix load state dict and add test (#5369) 10 months ago
test_pipeline Merge branch 'main' into sync/npu 10 months ago
test_shardformer [devops] fix compatibility (#5444) 8 months ago
test_smoothquant [inference] Add smmoothquant for llama (#4904) 1 year ago
test_tensor [misc] update pre-commit and run all files (#4752) 1 year ago
test_utils [feat] refactored extension module (#5298) 10 months ago
test_zero [npu] change device to accelerator api (#5239) 11 months ago
__init__.py