3 Commits (1fa8c5e09ff7422c30fe7683beb209bfba7e153b)

Author SHA1 Message Date
Xu Kai 946ab56c48
[feature] add gptq for inference (#4754) 1 year ago
Hongxin Liu 079bf3cb26
[misc] update pre-commit and run all files (#4752) 1 year ago
Cuiqing Li bce0f16702
[Feature] The first PR to Add TP inference engine, kv-cache manager and related kernels for our inference system (#4577) 1 year ago