ColossalAI/colossalai/inference/modeling/policy
Runyu Lu 18d67d0e8e
[Feat]Inference RPC Server Support (#5705)
* rpc support source
* kv cache logical/physical disaggregation
* sampler refactor
* colossalai launch built in
* Unitest
* Rpyc support

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-05-14 10:00:55 +08:00
..
__init__.py [inference/model]Adapted to the baichuan2-7B model (#5591) 2024-04-15 16:53:02 +08:00
glide_llama.py [Inference/SpecDec] Support GLIDE Drafter Model (#5455) 2024-04-10 11:07:52 +08:00
nopadding_baichuan.py [Feat]Inference RPC Server Support (#5705) 2024-05-14 10:00:55 +08:00
nopadding_llama.py [Feat]Inference RPC Server Support (#5705) 2024-05-14 10:00:55 +08:00