Commit Graph

3 Commits (c6295c3381535ed3098d12a72430659a49064a68)

Author SHA1 Message Date
Xu Kai c6295c3381
[Refactor] remove useless inference code (#5022)
* remove useless code

* fix quant model

* fix test import bug

* mv original inference legacy

* fix chatglm2
2023-11-10 14:47:06 +08:00
Xu Kai 450115bd0f [refactor] refactor gptq and smoothquant llama (#5012)
* refactor gptq and smoothquant llama

* fix import error

* fix linear import torch-int

* fix smoothquant llama import error

* fix import accelerate error

* fix bug

* fix import smooth cuda

* fix smoothcuda
2023-11-09 10:12:11 +08:00
Bin Jia 48d0a58d10 add support for bloom (#5008) 2023-11-09 10:12:11 +08:00