ColossalAI/colossalai/inference/modeling
char-1ee 04386d9eff Refactor modeling by adding attention backend
Signed-off-by: char-1ee <xingjianli59@gmail.com>
2024-06-07 08:33:47 +00:00
..
backends Refactor modeling by adding attention backend 2024-06-07 08:33:47 +00:00
layers [Inference] Adapt Baichuan2-13B TP (#5659) 2024-04-30 15:47:07 +08:00
models Refactor modeling by adding attention backend 2024-06-07 08:33:47 +00:00
policy [Feat]Inference RPC Server Support (#5705) 2024-05-14 10:00:55 +08:00
__init__.py [doc] updated inference readme (#5343) 2024-02-02 14:31:10 +08:00