mirror of https://github.com/hpcaitech/ColossalAI
![]() * merge kvcache with pipeline inference and refactor the code structure * support ppsize > 2 * refactor pipeline code * do pre-commit * modify benchmark * fix bench mark * polish code * add docstring and update readme * refactor the code * fix some logic bug of ppinfer * polish readme * fix typo * skip infer test |
||
---|---|---|
.. | ||
modeling | ||
policies | ||
__init__.py | ||
batch_infer_state.py | ||
engine.py | ||
kvcache_manager.py |