Commit Graph

4 Commits (8ec24b6a4d0e0dbec7da39e43c3c1b2cfcb0395d)

Author SHA1 Message Date
Runyu Lu 18d67d0e8e
[Feat]Inference RPC Server Support (#5705)
7 months ago
Yuanheng Zhao 3de2e62299 [Inference] Add CacheBlock and KV-Cache Manager (#5156)
11 months ago
Jianghai 4cf4682e70 [Inference] First PR for rebuild colossal-infer (#5143)
11 months ago
Xu Kai fd6482ad8c
[inference] Refactor inference architecture (#5057)
1 year ago