mirror of https://github.com/hpcaitech/ColossalAI
a37f82629d
* fix flash decoding mask during verification * add spec-dec * add test for spec-dec * revise drafter init * remove drafter sampling * retire past kv in drafter * (trivial) rename attrs * (trivial) rename arg * revise how we enable/disable spec-dec |
||
---|---|---|
.. | ||
jit | ||
triton | ||
__init__.py | ||
extensions | ||
kernel_loader.py |