Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Xu Kai fd6482ad8c
[inference] Refactor inference architecture (#5057)
1 year ago
..
__init__.py
batch_infer_state.py
kvcache_manager.py