ColossalAI/colossalai/pipeline
duanjunwen 9912cc8c07 [fix] fix bwd b; now bwd w only for Layer replaced by Linear1D_Col/Row; other layer perform a fully bwd; 2024-10-15 06:26:01 +00:00
..
schedule [fix] fix bwd b; now bwd w only for Layer replaced by Linear1D_Col/Row; other layer perform a fully bwd; 2024-10-15 06:26:01 +00:00
__init__.py [feat] add zerobubble pp (just a frame now); add POC test for dx_dw; add test for zerobubble; 2024-08-22 10:25:34 +00:00
p2p.py fix object_to_tensor usage when torch>=2.3.0 (#5820) 2024-07-16 13:59:25 +08:00
stage_manager.py [plugin] hybrid support zero bubble pipeline (#6060) 2024-09-27 14:48:55 +08:00
weight_grad_store.py [feat] support use_zbv in llama, mixtral modeling; only replace Linear1D_Col/Row policy; 2024-10-14 07:12:14 +00:00