You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai/zero/gemini
botbw 3bcbba9262
[gemini] quick fix on possible async operation (#5803)
6 months ago
..
chunk [Gemini] Use async stream to prefetch and h2d data moving (#5781) 6 months ago
memory_tracer [npu] change device to accelerator api (#5239) 11 months ago
__init__.py [shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088) 1 year ago
gemini_ddp.py [Gemini] Use async stream to prefetch and h2d data moving (#5781) 6 months ago
gemini_hook.py [gemini] quick fix on possible async operation (#5803) 6 months ago
gemini_mgr.py [chore] remove unnecessary assert since compute list might not be recorded 6 months ago
gemini_optimizer.py [gemini] async grad chunk reduce (all-reduce&reduce-scatter) (#5713) 6 months ago
placement_policy.py [bug] continue fix 6 months ago
utils.py [npu] change device to accelerator api (#5239) 11 months ago