ColossalAI

mirror of https://github.com/hpcaitech/ColossalAI

History

Yuanheng Zhao 7b249c76e5 [Fix] Fix spec-dec Glide LlamaModel for compatibility with transformers (#5837 ) * fix glide llama model * revise		2024-06-19 15:37:53 +08:00
..
benchmark_ops	add paged-attetionv2: support seq length split across thread block (#5707 )	2024-05-14 12:46:54 +08:00
client	[Inference]Fix readme and example for API server (#5742 )	2024-05-24 10:03:05 +08:00
llama	[Fix] Fix spec-dec Glide LlamaModel for compatibility with transformers (#5837 )	2024-06-19 15:37:53 +08:00