Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Jianghai 85946d4236
[Inference]Fix readme and example for API server (#5742)
6 months ago
..
__init__.py [Inference] ADD async and sync Api server using FastAPI (#5396) 7 months ago
api_server.py [Inference]Fix readme and example for API server (#5742) 6 months ago
chat_service.py [Online Server] Chat Api for streaming and not streaming response (#5470) 7 months ago
completion_service.py [Inference] Fix API server, test and example (#5712) 6 months ago
utils.py [Online Server] Chat Api for streaming and not streaming response (#5470) 7 months ago