ColossalAI/colossalai/quantization
Guangyao Zhang f20b066c59
[fp8] Disable all_gather intranode. Disable Redundant all_gather fp8 (#6059)
* all_gather only internode, fix pytest

* fix cuda arch <89 compile pytest error

* fix pytest failure

* disable all_gather_into_tensor_flat_fp8

* fix fp8 format

* fix pytest

* fix conversations

* fix chunk tuple to list
2024-09-14 10:40:01 +08:00
..
__init__.py [Feature] qlora support (#5586) 2024-04-28 10:51:27 +08:00
bnb.py [quant] fix bitsandbytes version check (#5882) 2024-07-04 11:33:23 +08:00
bnb_config.py [Feature] qlora support (#5586) 2024-04-28 10:51:27 +08:00
fp8.py [fp8] Disable all_gather intranode. Disable Redundant all_gather fp8 (#6059) 2024-09-14 10:40:01 +08:00
fp8_hook.py [fp8] support gemini plugin (#5978) 2024-08-09 14:09:48 +08:00
utils.py [Feature]: support FP8 communication in DDP, FSDP, Gemini (#5928) 2024-08-08 15:55:01 +08:00