4 Commits (8241c0c054b38a109ed3ce7be1052a1e600b8471)

Author SHA1 Message Date
Runyu Lu 18d67d0e8e
[Feat]Inference RPC Server Support (#5705) 6 months ago
Yuanheng Zhao 3de2e62299 [Inference] Add CacheBlock and KV-Cache Manager (#5156) 11 months ago
Jianghai 4cf4682e70 [Inference] First PR for rebuild colossal-infer (#5143) 11 months ago
Xu Kai fd6482ad8c
[inference] Refactor inference architecture (#5057) 1 year ago