Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Xu Kai fd6482ad8c
[inference] Refactor inference architecture (#5057)
1 year ago
..
gptq [inference] Refactor inference architecture (#5057) 1 year ago
smoothquant [inference] Refactor inference architecture (#5057) 1 year ago