ColossalAI/colossalai/quantization
Hanks 5ce6dd75bf
[fp8] disable all_to_all_fp8 in intranode (#6045)
* enhance all_to_all_fp8 with internode comm control

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* disable some fp8 ops due to performance issue

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-09 13:47:17 +08:00
..
__init__.py [Feature] qlora support (#5586) 2024-04-28 10:51:27 +08:00
bnb.py [quant] fix bitsandbytes version check (#5882) 2024-07-04 11:33:23 +08:00
bnb_config.py [Feature] qlora support (#5586) 2024-04-28 10:51:27 +08:00
fp8.py [fp8] disable all_to_all_fp8 in intranode (#6045) 2024-09-09 13:47:17 +08:00
fp8_hook.py [fp8] support gemini plugin (#5978) 2024-08-09 14:09:48 +08:00
utils.py [Feature]: support FP8 communication in DDP, FSDP, Gemini (#5928) 2024-08-08 15:55:01 +08:00