ColossalAI/colossalai/inference/quant/gptq
Xu Kai 450115bd0f [refactor] refactor gptq and smoothquant llama (#5012)
* refactor gptq and smoothquant llama

* fix import error

* fix linear import torch-int

* fix smoothquant llama import error

* fix import accelerate error

* fix bug

* fix import smooth cuda

* fix smoothcuda
2023-11-09 10:12:11 +08:00
..
cai_gptq [NFC] polish colossalai/inference/quant/gptq/cai_gptq/__init__.py code style (#4792) 2023-10-07 13:36:52 +08:00
__init__.py [refactor] refactor gptq and smoothquant llama (#5012) 2023-11-09 10:12:11 +08:00
gptq_manager.py [refactor] refactor gptq and smoothquant llama (#5012) 2023-11-09 10:12:11 +08:00