ColossalAI/colossalai/inference/modeling
Yuanheng Zhao 7b249c76e5
[Fix] Fix spec-dec Glide LlamaModel for compatibility with transformers (#5837)
* fix glide llama model

* revise
2024-06-19 15:37:53 +08:00
..
backends [Inference] Fix flash-attn import and add model test (#5794) 2024-06-12 14:13:50 +08:00
layers [Inference]refactor baichuan (#5791) 2024-06-11 10:52:01 +08:00
models [Fix] Fix spec-dec Glide LlamaModel for compatibility with transformers (#5837) 2024-06-19 15:37:53 +08:00
policy [Inference]refactor baichuan (#5791) 2024-06-11 10:52:01 +08:00
__init__.py [doc] updated inference readme (#5343) 2024-02-02 14:31:10 +08:00