10 Commits (main)

Author SHA1 Message Date
Guangyao Zhang f20b066c59
[fp8] Disable all_gather intranode. Disable Redundant all_gather fp8 (#6059) 2 months ago
flybird11111 20722a8c93
[fp8]update reduce-scatter test (#6002) 3 months ago
flybird11111 597b206001
[fp8] support asynchronous FP8 communication (#5997) 3 months ago
Hongxin Liu 8241c0c054
[fp8] support gemini plugin (#5978) 3 months ago
Hanks b480eec738
[Feature]: support FP8 communication in DDP, FSDP, Gemini (#5928) 4 months ago
Hongxin Liu ccabcf6485
[fp8] support fp8 amp for hybrid parallel plugin (#5975) 4 months ago
Hongxin Liu 76ea16466f
[fp8] add fp8 linear (#5967) 4 months ago
flybird11111 afb26de873
[fp8]support all2all fp8 (#5953) 4 months ago
Guangyao Zhang 53cb9606bd
[Feature] llama shardformer fp8 support (#5938) 4 months ago
Hongxin Liu 5fd0592767
[fp8] support all-gather flat tensor (#5932) 4 months ago