ColossalAI/colossalai/nn/parallel
HELSON 2458659919
[zero] fix error for BEiT models (#2169)
* [zero] fix error for BEiT models

* [ColoParameter] add unpack operation for tuple arguments

* fix bugs

* fix chunkv2 unit testing

* add assertion for gradient state
2022-12-26 15:03:54 +08:00
..
layers [embedding] rename FreqAwareEmbedding -> CachedEmbedding (#1699) 2022-10-13 22:22:27 +08:00
__init__.py [Gemini] make gemini usage simple (#1821) 2022-11-08 15:53:13 +08:00
data_parallel.py [zero] fix error for BEiT models (#2169) 2022-12-26 15:03:54 +08:00
gemini_parallel.py [Gemini] chunk init using runtime visited param order (#2115) 2022-12-12 18:06:16 +08:00
reducer.py [ddp] ColoDDP uses bucket all-reduce (#1177) 2022-06-29 10:34:13 +08:00
utils.py [Gemini] GeminiDPP convert to PyTorch Module. (#2151) 2022-12-20 10:19:36 +08:00