Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Guangyao Zhang f20b066c59
[fp8] Disable all_gather intranode. Disable Redundant all_gather fp8 (#6059)
2 months ago
..
test_all_to_all_single.py [fp8] support asynchronous FP8 communication (#5997) 3 months ago
test_fp8_all_to_all.py [fp8] Disable all_gather intranode. Disable Redundant all_gather fp8 (#6059) 2 months ago
test_fp8_all_to_all_single.py [Feature] llama shardformer fp8 support (#5938) 4 months ago
test_fp8_allgather.py [fp8] Disable all_gather intranode. Disable Redundant all_gather fp8 (#6059) 2 months ago
test_fp8_allreduce.py [fp8] support asynchronous FP8 communication (#5997) 3 months ago
test_fp8_cast.py [Feature]: support FP8 communication in DDP, FSDP, Gemini (#5928) 4 months ago
test_fp8_ddp_comm_hook.py [Feature]: support FP8 communication in DDP, FSDP, Gemini (#5928) 4 months ago
test_fp8_fsdp_comm_hook.py [Feature]: support FP8 communication in DDP, FSDP, Gemini (#5928) 4 months ago
test_fp8_hook.py [fp8] support gemini plugin (#5978) 3 months ago
test_fp8_linear.py [fp8] add fp8 linear (#5967) 4 months ago
test_fp8_reduce_scatter.py [fp8]update reduce-scatter test (#6002) 3 months ago