You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai/legacy/inference/dynamic_batching
Xu Kai fd6482ad8c
[inference] Refactor inference architecture (#5057)
1 year ago
..
__init__.py [inference] Refactor inference architecture (#5057) 1 year ago
get_tokenizer.py [inference] Refactor inference architecture (#5057) 1 year ago
infer_batch.py [inference] Refactor inference architecture (#5057) 1 year ago
io_struct.py [inference] Refactor inference architecture (#5057) 1 year ago
ray_dist_init.py [inference] Refactor inference architecture (#5057) 1 year ago
ray_init_config.py [inference] Refactor inference architecture (#5057) 1 year ago
req_queue.py [inference] Refactor inference architecture (#5057) 1 year ago
sampling_params.py [inference] Refactor inference architecture (#5057) 1 year ago
stats.py [inference] Refactor inference architecture (#5057) 1 year ago