You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai/inference/readme.md

19 lines
426 B

# Colossal-Infer
## Introduction
Colossal-Infer is a library for inference of LLMs and MLMs. It is built on top of Colossal AI.
## Structures
### Overview
The main design will be released later on.
## Roadmap
- [] design of structures
- [] Core components
- [] engine
- [] request handler
- [] kv cache manager
- [] modeling
- [] custom layers
- [] online server
- [] supported models
- [] llama2