ColossalAI/examples/inference
Jianghai 85946d4236
[Inference]Fix readme and example for API server (#5742)
* fix chatapi readme and example

* updating doc

* add an api and change the doc

* remove

* add credits and del 'API' heading

* readme

* readme
2024-05-24 10:03:05 +08:00
..
benchmark_ops add paged-attetionv2: support seq length split across thread block (#5707) 2024-05-14 12:46:54 +08:00
client [Inference]Fix readme and example for API server (#5742) 2024-05-24 10:03:05 +08:00
llama [example] Update Inference Example (#5725) 2024-05-17 11:28:53 +08:00