ColossalAI/colossalai/inference/modeling/models
Runyu Lu 3c7cda0c9a
[Inference]Lazy Init Support (#5785)
* lazy init support

* lazy init llama support

* :lazy init support for baichuan

* aligh rpc

* add note for baichuan

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-06-27 18:02:15 +08:00
..
__init__.py fix bugs in request_handler 2024-01-11 13:39:56 +00:00
glide_llama.py [Fix] Fix spec-dec Glide LlamaModel for compatibility with transformers (#5837) 2024-06-19 15:37:53 +08:00
nopadding_baichuan.py [Inference]Lazy Init Support (#5785) 2024-06-27 18:02:15 +08:00
nopadding_llama.py [Inference]Lazy Init Support (#5785) 2024-06-27 18:02:15 +08:00