mirror of https://github.com/hpcaitech/ColossalAI
![]() * opt flash attn * opt tmp tensor * fix benchmark_llama * fix code style * fix None logic for output tensor * fix adapted to get_xine_cache * add comment * fix ci bugs * fix some codes * rm duplicated codes * rm duplicated codes * fix code style * add _get_dtype in config.py |
||
---|---|---|
.. | ||
benchmark_llama.py | ||
build_smoothquant_weight.py | ||
run_benchmark.sh | ||
run_llama_inference.py |