Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
botbw 8e718a1421
[gemini] fixes for benchmarking (#5847)
5 months ago
..
chunk [gemini] fixes for benchmarking (#5847) 5 months ago
memory_tracer [npu] change device to accelerator api (#5239) 11 months ago
__init__.py [shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088) 1 year ago
gemini_ddp.py [gemini] fixes for benchmarking (#5847) 5 months ago
gemini_hook.py [gemini] quick fix on possible async operation (#5803) 5 months ago
gemini_mgr.py [chore] remove unnecessary assert since compute list might not be recorded 6 months ago
gemini_optimizer.py [gemini] async grad chunk reduce (all-reduce&reduce-scatter) (#5713) 6 months ago
placement_policy.py [bug] continue fix 6 months ago
utils.py [npu] change device to accelerator api (#5239) 11 months ago