ColossalAI/colossalai/kernel
yuehuayingxueluo 3c91e3f176
[Inference]Adapt to baichuan2 13B (#5614)
* adapt to baichuan2 13B

* adapt to baichuan2 13B

* change BAICHUAN_MODEL_NAME_OR_PATH

* fix test_decoding_attn.py

* Modifications based on review comments.

* change BAICHUAN_MODEL_NAME_OR_PATH

* mv attn mask processes to test flash decoding

* mv get_alibi_slopes baichuan modeling

* fix bugs in test_baichuan.py
2024-04-25 23:11:30 +08:00
..
jit [npu] change device to accelerator api (#5239) 2024-01-09 10:20:05 +08:00
triton [Inference]Adapt to baichuan2 13B (#5614) 2024-04-25 23:11:30 +08:00
__init__.py [feat] refactored extension module (#5298) 2024-01-25 17:01:48 +08:00
extensions [feat] refactored extension module (#5298) 2024-01-25 17:01:48 +08:00
kernel_loader.py [Fix] resolve conflicts of merging main 2024-04-08 16:21:47 +08:00