mirror of https://github.com/hpcaitech/ColossalAI
![]() * Fix bugs about OOM when running vllm-0.4.0 * rm used params * change generation_config * change benchmark log file name |
||
---|---|---|
.. | ||
benchmark_ops | ||
benchmark_llama.py | ||
benchmark_llama3.py | ||
build_smoothquant_weight.py | ||
llama_generation.py | ||
run_benchmark.sh | ||
run_llama_inference.py |