ColossalAI/tests/test_infer
Jianghai 1f8c7e7046
[Inference] User Experience: update the logic of default tokenizer and generation config. (#5337)
* add

* fix

* fix

* pause

* fix

* fix pytest

* align

* fix

* license

* fix

* fix

* fix readme

* fix some bugs

* remove tokenizer config
2024-02-07 17:55:48 +08:00
..
test_models [Infer] Optimize Blocked KVCache And Kernels Using It (#5325) 2024-01-30 16:06:09 +08:00
test_ops/triton [Inference/opt] Fused KVCahce Memcopy (#5374) 2024-02-07 17:15:42 +08:00
_utils.py [Inference] Add the logic of the inference engine (#5173) 2024-01-11 13:39:56 +00:00
test_config_and_struct.py [inference] moved ops tests to test_infer (#5354) 2024-02-02 13:51:22 +08:00
test_inference_engine.py [Inference] User Experience: update the logic of default tokenizer and generation config. (#5337) 2024-02-07 17:55:48 +08:00
test_kvcache_manager.py [Infer] Optimize Blocked KVCache And Kernels Using It (#5325) 2024-01-30 16:06:09 +08:00
test_request_handler.py [inference]Optimize the usage of the mid tensors space in flash attn (#5304) 2024-01-26 14:00:10 +08:00