ColossalAI/colossalai/inference/modeling
Runyu Lu 3c7cda0c9a
[Inference]Lazy Init Support (#5785)
* lazy init support

* lazy init llama support

* :lazy init support for baichuan

* aligh rpc

* add note for baichuan

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-06-27 18:02:15 +08:00
..
backends [Inference] Fix flash-attn import and add model test (#5794) 2024-06-12 14:13:50 +08:00
layers [Inference]Lazy Init Support (#5785) 2024-06-27 18:02:15 +08:00
models [Inference]Lazy Init Support (#5785) 2024-06-27 18:02:15 +08:00
policy [Inference]refactor baichuan (#5791) 2024-06-11 10:52:01 +08:00
__init__.py [doc] updated inference readme (#5343) 2024-02-02 14:31:10 +08:00