Commit Graph

3 Commits (c7d68b2c2ca3f7fd32056ea952fae4fe239f75ea)

Author SHA1 Message Date
Xu Kai fd6482ad8c
[inference] Refactor inference architecture (#5057)
1 year ago
Hongxin Liu 079bf3cb26
[misc] update pre-commit and run all files (#4752)
1 year ago
Cuiqing Li bce0f16702
[Feature] The first PR to Add TP inference engine, kv-cache manager and related kernels for our inference system (#4577)
1 year ago