mirror of https://github.com/hpcaitech/ColossalAI
1cd7efc520
* [setup] refactor infer setup * [hotfix] fix infenrece behavior on 1 1 gpu * [exmaple] refactor inference examples |
||
---|---|---|
.. | ||
benchmark_llama.py | ||
build_smoothquant_weight.py | ||
run_benchmark.sh | ||
run_llama_inference.py |