ColossalAI/colossalai/zero
hxwang f45f8a2aa7 [gemini] maxprefetch means maximum work to keep 2024-05-16 16:12:53 +08:00
..
gemini [gemini] maxprefetch means maximum work to keep 2024-05-16 16:12:53 +08:00
low_level [Feature] Distributed optimizers: Lamb, Galore, CAME and Adafactor (#5694) 2024-05-14 13:52:45 +08:00
__init__.py [shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088) 2023-11-28 16:54:42 +08:00
wrapper.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00