Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Hongxin Liu 5ddad486ca
[fp8] add fallback and make compile option configurable (#6092)
1 month ago
..
__init__.py [Feature] qlora support (#5586) 7 months ago
bnb.py [quant] fix bitsandbytes version check (#5882) 5 months ago
bnb_config.py [Feature] qlora support (#5586) 7 months ago
fp8.py [fp8] add fallback and make compile option configurable (#6092) 1 month ago
fp8_config.py [fp8] add fallback and make compile option configurable (#6092) 1 month ago
fp8_hook.py [fp8] support gemini plugin (#5978) 4 months ago
utils.py [Feature]: support FP8 communication in DDP, FSDP, Gemini (#5928) 4 months ago