Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Steve Luo 725fbd2ed0
[Inference] Remove unnecessary float4_ and rename float8_ to float8 (#5679)
7 months ago
..
data_type.h [Inference] Remove unnecessary float4_ and rename float8_ to float8 (#5679) 7 months ago
micros.h [Inference/Refactor] Refactor compilation mechanism and unified multi hw (#5613) 7 months ago
mp_type_traits.h [Inference/Feat] Add kvcache quant support for fused_rotary_embedding_cache_copy (#5680) 7 months ago
target.h add implementatino for GetGPULaunchConfig1D 8 months ago
vec_type_traits.h [Inference] Remove unnecessary float4_ and rename float8_ to float8 (#5679) 7 months ago