mirror of https://github.com/hpcaitech/ColossalAI
![]() * added flash-decoidng of triton based on lightllm kernel * add req * clean * clean * delete build.sh --------- Co-authored-by: cuiqing.li <lixx336@gmail.com> |
||
---|---|---|
.. | ||
_utils.py | ||
benchmark.py | ||
hybrid_gptq_llama.py | ||
hybrid_llama.py | ||
hybrid_smoothquant_llama.py | ||
run_benchmark.sh | ||
smoothquant_llama.py |