mirror of https://github.com/hpcaitech/ColossalAI
a37f82629d
* fix flash decoding mask during verification * add spec-dec * add test for spec-dec * revise drafter init * remove drafter sampling * retire past kv in drafter * (trivial) rename attrs * (trivial) rename arg * revise how we enable/disable spec-dec |
||
---|---|---|
.. | ||
__init__.py | ||
context_attn_unpad.py | ||
flash_decoding.py | ||
fused_rotary_embedding.py | ||
kvcache_copy.py | ||
llama_act_combine_kernel.py | ||
no_pad_rotary_embedding.py | ||
qkv_matmul_kernel.py | ||
rms_layernorm.py | ||
rotary_cache_copy.py | ||
softmax.py |