This website requires JavaScript.
Explore
关于
Help
Register
Sign In
github
/
ColossalAI
mirror of
https://github.com/hpcaitech/ColossalAI
Watch
1
Star
0
Fork
You've already forked ColossalAI
0
Code
Issues
Projects
Releases
Wiki
Activity
5bbab1533a
ColossalAI
/
colossalai
/
inference
/
modeling
History
Steve Luo
7806842f2d
add paged-attetionv2: support seq length split across thread block (
#5707
)
2024-05-14 12:46:54 +08:00
..
layers
[Inference] Adapt Baichuan2-13B TP (
#5659
)
2024-04-30 15:47:07 +08:00
models
add paged-attetionv2: support seq length split across thread block (
#5707
)
2024-05-14 12:46:54 +08:00
policy
[Feat]Inference RPC Server Support (
#5705
)
2024-05-14 10:00:55 +08:00
__init__.py
[doc] updated inference readme (
#5343
)
2024-02-02 14:31:10 +08:00