ColossalAI/colossalai/zero/gemini
botbw 8e718a1421
[gemini] fixes for benchmarking (#5847)
* [gemini] fix missing return

* [gemini] fix missing arg pass

* [gemini] use gather tensor instead of list

* [test] enable flash attention for benchmark by default

* [test] enable flash attention for benchmark by default

---------

Co-authored-by: genghaozhe <939857490@qq.com>
2024-06-26 15:52:09 +08:00
..
chunk [gemini] fixes for benchmarking (#5847) 2024-06-26 15:52:09 +08:00
memory_tracer [npu] change device to accelerator api (#5239) 2024-01-09 10:20:05 +08:00
__init__.py [shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088) 2023-11-28 16:54:42 +08:00
gemini_ddp.py [gemini] fixes for benchmarking (#5847) 2024-06-26 15:52:09 +08:00
gemini_hook.py [gemini] quick fix on possible async operation (#5803) 2024-06-13 10:35:17 +08:00
gemini_mgr.py [chore] remove unnecessary assert since compute list might not be recorded 2024-05-28 05:16:02 +00:00
gemini_optimizer.py [gemini] async grad chunk reduce (all-reduce&reduce-scatter) (#5713) 2024-05-24 10:31:16 +08:00
placement_policy.py [bug] continue fix 2024-05-28 02:41:23 +00:00
utils.py [npu] change device to accelerator api (#5239) 2024-01-09 10:20:05 +08:00