ColossalAI/colossalai/inference/modeling
flybird11111 2ddf624a86
[shardformer] upgrade transformers to 4.39.3 (#5815)
* [shardformer]upgrade transformers for gpt2/gptj/whisper (#5807)

* [shardformer] fix modeling of gpt2 and gptj

* [shardformer] fix whisper modeling

* [misc] update requirements

---------

Co-authored-by: ver217 <lhx0217@gmail.com>

* [shardformer]upgrade transformers for mistral (#5808)

* upgrade transformers for mistral

* fix

* fix

* [shardformer]upgrade transformers for llama (#5809)

* update transformers

fix

* fix

* fix

* [inference] upgrade transformers (#5810)

* update transformers

fix

* fix

* fix

* fix

* fix

* [gemini] update transformers for gemini (#5814)

---------

Co-authored-by: ver217 <lhx0217@gmail.com>
2024-06-14 10:59:33 +08:00
..
backends [Inference] Fix flash-attn import and add model test (#5794) 2024-06-12 14:13:50 +08:00
layers [Inference]refactor baichuan (#5791) 2024-06-11 10:52:01 +08:00
models [shardformer] upgrade transformers to 4.39.3 (#5815) 2024-06-14 10:59:33 +08:00
policy [Inference]refactor baichuan (#5791) 2024-06-11 10:52:01 +08:00
__init__.py [doc] updated inference readme (#5343) 2024-02-02 14:31:10 +08:00