Commit Graph

6 Commits (65e5d6baa51314414a6d0a3533226e978708408c)

Author SHA1 Message Date
Xu Kai fd6482ad8c
[inference] Refactor inference architecture (#5057)
1 year ago
Jianghai cf579ff46d
[Inference] Dynamic Batching Inference, online and offline (#4953)
1 year ago
Xu Kai 785802e809
[inference] add reference and fix some bugs (#4937)
1 year ago
Xu Kai 611a5a80ca
[inference] Add smmoothquant for llama (#4904)
1 year ago
Michelle 07ed155e86 [NFC] polish colossalai/inference/quant/gptq/cai_gptq/__init__.py code style (#4792)
1 year ago
Xu Kai 946ab56c48
[feature] add gptq for inference (#4754)
1 year ago