mirror of https://github.com/hpcaitech/ColossalAI
![]() * feat flash decoding for paged attention * refactor flashdecodingattention * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> |
||
---|---|---|
.. | ||
benchmark_ops | ||
benchmark_llama.py | ||
build_smoothquant_weight.py | ||
run_benchmark.sh | ||
run_llama_inference.py |