You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai/inference/quant/smoothquant
Xu Kai fd6482ad8c
[inference] Refactor inference architecture (#5057)
1 year ago
..
models [inference] Refactor inference architecture (#5057) 1 year ago
__init__.py [inference] Add smmoothquant for llama (#4904) 1 year ago