ColossalAI/colossalai/inference/modeling/models
Jianghai 1f8c7e7046
[Inference] User Experience: update the logic of default tokenizer and generation config. (#5337)
* add

* fix

* fix

* pause

* fix

* fix pytest

* align

* fix

* license

* fix

* fix

* fix readme

* fix some bugs

* remove tokenizer config
2024-02-07 17:55:48 +08:00
..
__init__.py fix bugs in request_handler 2024-01-11 13:39:56 +00:00
nopadding_llama.py [Inference] User Experience: update the logic of default tokenizer and generation config. (#5337) 2024-02-07 17:55:48 +08:00
padding_llama.py [Inference/opt] Fused KVCahce Memcopy (#5374) 2024-02-07 17:15:42 +08:00