Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Hongxin Liu 8241c0c054
[fp8] support gemini plugin (#5978)
4 months ago
..
chunk [Feature]: support FP8 communication in DDP, FSDP, Gemini (#5928) 4 months ago
memory_tracer [npu] change device to accelerator api (#5239) 11 months ago
__init__.py [shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088) 1 year ago
gemini_ddp.py [fp8] support gemini plugin (#5978) 4 months ago
gemini_hook.py [gemini] quick fix on possible async operation (#5803) 5 months ago
gemini_mgr.py [chore] remove unnecessary assert since compute list might not be recorded 6 months ago
gemini_optimizer.py [gemini] async grad chunk reduce (all-reduce&reduce-scatter) (#5713) 6 months ago
placement_policy.py [bug] continue fix 6 months ago
utils.py [npu] change device to accelerator api (#5239) 11 months ago