mirror of https://github.com/hpcaitech/ColossalAI
![]() * remove useless code * fix quant model * fix test import bug * mv original inference legacy * fix chatglm2 |
||
---|---|---|
.. | ||
serving | ||
_utils.py | ||
bench_bloom.py | ||
bench_chatglm2.py | ||
bench_llama.py | ||
benchmark.py | ||
gptq_bloom.py | ||
gptq_llama.py | ||
hybrid_gptq_llama.py | ||
hybrid_smoothquant_llama.py | ||
run.sh | ||
smoothquant_llama.py |