Commit Graph

4 Commits (810cafb2f987cac2bbe99ef491455921f197f315)

Author SHA1 Message Date
Runyu Lu 18d67d0e8e
[Feat]Inference RPC Server Support (#5705)
7 months ago
Yuanheng Zhao 3de2e62299 [Inference] Add CacheBlock and KV-Cache Manager (#5156)
11 months ago
Jianghai 4cf4682e70 [Inference] First PR for rebuild colossal-infer (#5143)
11 months ago
Xu Kai fd6482ad8c
[inference] Refactor inference architecture (#5057)
1 year ago