Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Hongxin Liu d202cc28c0
[npu] change device to accelerator api (#5239)
11 months ago
..
benchmark_llama.py [npu] change device to accelerator api (#5239) 11 months ago
build_smoothquant_weight.py [inference] refactor examples and fix schedule (#5077) 1 year ago
run_benchmark.sh [inference] refactor examples and fix schedule (#5077) 1 year ago
run_llama_inference.py [npu] change device to accelerator api (#5239) 11 months ago