Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Li Xingjian 8554585a5f
[Inference] Fix flash-attn import and add model test (#5794)
5 months ago
..
test_attention.py [Fix] Fix Inference Example, Tests, and Requirements (#5688) 7 months ago
test_baichuan.py Pass inference model shard configs for module init 6 months ago
test_custom_model.py [Inference] Fix flash-attn import and add model test (#5794) 5 months ago