ColossalAI/examples/inference
Xu Kai 20332a7a34
[inference] udpate example (#5053)
* udpate example

* fix run.sh
2023-11-16 11:07:43 +08:00
..
_utils.py [Inference]ADD Bench Chatglm2 script (#4963) 2023-10-24 13:11:15 +08:00
benchmark.py [inference] udpate example (#5053) 2023-11-16 11:07:43 +08:00
hybrid_gptq_llama.py [refactor] refactor gptq and smoothquant llama (#5012) 2023-11-09 10:12:11 +08:00
hybrid_llama.py [inference] udpate example (#5053) 2023-11-16 11:07:43 +08:00
hybrid_smoothquant_llama.py [refactor] refactor gptq and smoothquant llama (#5012) 2023-11-09 10:12:11 +08:00
run.sh [inference] udpate example (#5053) 2023-11-16 11:07:43 +08:00
smoothquant_llama.py [inference] Add smmoothquant for llama (#4904) 2023-10-16 11:28:44 +08:00