ColossalAI/colossalai/inference/modeling/policy
yuehuayingxueluo 5f00002e43
[Inference] Adapt Baichuan2-13B TP (#5659)
* adapt to baichuan2 13B

* add baichuan2 13B TP

* update baichuan tp logic

* rm unused code

* Fix TP logic

* fix alibi slopes tp logic

* rm nn.Module

* Polished the code.

* change BAICHUAN_MODEL_NAME_OR_PATH

* Modified the logic for loading Baichuan weights.

* fix typos
2024-04-30 15:47:07 +08:00
..
__init__.py [inference/model]Adapted to the baichuan2-7B model (#5591) 2024-04-15 16:53:02 +08:00
glide_llama.py [Inference/SpecDec] Support GLIDE Drafter Model (#5455) 2024-04-10 11:07:52 +08:00
nopadding_baichuan.py [Inference] Adapt Baichuan2-13B TP (#5659) 2024-04-30 15:47:07 +08:00
nopadding_llama.py [example] Update Llama Inference example (#5629) 2024-04-23 22:23:07 +08:00