ColossalAI/colossalai/inference/modeling/policy
Runyu Lu c0948aff97
[Inference]refactor baichuan (#5791)
* refactor baichuan

* remove unused code and add TODO for lazyinit
2024-06-11 10:52:01 +08:00
..
__init__.py [inference/model]Adapted to the baichuan2-7B model (#5591) 2024-04-15 16:53:02 +08:00
glide_llama.py [Inference/SpecDec] Support GLIDE Drafter Model (#5455) 2024-04-10 11:07:52 +08:00
nopadding_baichuan.py [Inference]refactor baichuan (#5791) 2024-06-11 10:52:01 +08:00
nopadding_llama.py Pass inference model shard configs for module init 2024-06-07 08:33:52 +00:00