mirror of https://github.com/hpcaitech/ColossalAI
a37f82629d
* fix flash decoding mask during verification * add spec-dec * add test for spec-dec * revise drafter init * remove drafter sampling * retire past kv in drafter * (trivial) rename attrs * (trivial) rename arg * revise how we enable/disable spec-dec |
||
---|---|---|
.. | ||
test_models | ||
test_ops | ||
_utils.py | ||
test_batch_bucket.py | ||
test_config_and_struct.py | ||
test_cuda_graph.py | ||
test_drafter.py | ||
test_inference_engine.py | ||
test_kvcache_manager.py | ||
test_request_handler.py |