ColossalAI/colossalai/zero
Hongxin Liu a2596519fd
[zero] support extra dp (#6123)
* [zero] support extra dp

* [zero] update checkpoint

* fix bugs

* fix bugs
2024-11-12 11:20:46 +08:00
..
gemini [plugin] support get_grad_norm (#6115) 2024-11-05 18:12:47 +08:00
low_level [zero] support extra dp (#6123) 2024-11-12 11:20:46 +08:00
__init__.py [shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088) 2023-11-28 16:54:42 +08:00
wrapper.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00