ColossalAI/examples/inference
Yuanheng Zhao 56ed09aba5 [sync] resolve conflicts of merging main 2024-05-05 05:14:00 +00:00
..
benchmark_ops [kernel] Support New KCache Layout - Triton Kernel (#5677) 2024-05-03 17:20:45 +08:00
benchmark_llama.py [sync] resolve conflicts of merging main 2024-05-05 05:14:00 +00:00
benchmark_llama3.py [Fix/Inference]Fix vllm benchmark (#5630) 2024-04-24 14:51:36 +08:00
llama_generation.py [example] Update Llama Inference example (#5629) 2024-04-23 22:23:07 +08:00
run_benchmark.sh [Fix/Inference]Fix vllm benchmark (#5630) 2024-04-24 14:51:36 +08:00