Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
flybird11111 a0ad587c24
[shardformer] refactor embedding resize (#5603)
7 months ago
..
kit [devops] remove post commit ci (#5566) 8 months ago
test_analyzer [misc] update pre-commit and run all files (#4752) 1 year ago
test_auto_parallel [npu] change device to accelerator api (#5239) 11 months ago
test_autochunk [misc] update pre-commit and run all files (#4752) 1 year ago
test_booster [shardformer] fix pipeline forward error if custom layer distribution is used (#5189) 8 months ago
test_checkpoint_io [shardformer] refactor embedding resize (#5603) 7 months ago
test_cluster [shardformer] Sequence Parallelism Optimization (#5533) 8 months ago
test_config [misc] update pre-commit and run all files (#4752) 1 year ago
test_device [misc] update pre-commit and run all files (#4752) 1 year ago
test_fx [misc] update pre-commit and run all files (#4752) 1 year ago
test_gptq [devops] remove post commit ci (#5566) 8 months ago
test_infer [Hotfix] Fix model policy matching strategy in ShardFormer (#5064) 1 year ago
test_lazy [devops] remove post commit ci (#5566) 8 months ago
test_legacy [npu] change device to accelerator api (#5239) 11 months ago
test_moe [hotfix] set return_outputs=False in examples and polish code (#5404) 8 months ago
test_optimizer [devops] remove post commit ci (#5566) 8 months ago
test_pipeline [devops] remove post commit ci (#5566) 8 months ago
test_shardformer [shardformer] refactor embedding resize (#5603) 7 months ago
test_smoothquant [inference] Add smmoothquant for llama (#4904) 1 year ago
test_tensor [shardformer] refactor embedding resize (#5603) 7 months ago
test_zero [npu] change device to accelerator api (#5239) 11 months ago
__init__.py [zero] Update sharded model v2 using sharded param v2 (#323) 3 years ago