3857 Commits (5b5fbcff09092ccecf54dde05dc6ee25235d98b2)
 

Author SHA1 Message Date
wangbluo d891e50617 fix 1 month ago
wangbluo e1e86f9f1f fix 1 month ago
Tong Li 4c8e85ee0d
[Coati] Train DPO using PP (#6054) 1 month ago
wangbluo 703bb5c18d fix the test 1 month ago
wangbluo 4e0e99bb6a fix the test 1 month ago
duanjunwen 0ca16d5cbe [fix] fix llama, mixtral benchmark zbv loss none bug; update mixtral & llama policy and modeling; 1 month ago
wangbluo 1507a7528f fix 1 month ago
wangbluo 0002ae5956 fix 1 month ago
flybird11111 dac0e07b13
[zero bubble] support zero (#6080) 1 month ago
Hongxin Liu dc2cdaf3e8
[shardformer] optimize seq parallelism (#6086) 1 month ago
wangbluo efe3042bb2 fix 1 month ago
梁爽 6b2c506fc5
Update README.md (#6087) 1 month ago
wangbluo 5ecc27e150 fix 1 month ago
wangbluo f98384aef6 fix 1 month ago
duanjunwen e234dfa236 [feat] support MixtralPipelineForwards--> mixtral_for_causal_lm_forward for zbv 1 month ago
Hongxin Liu 646b3c5a90
[shardformer] fix linear 1d row and support uneven splits for fused qkv linear (#6084) 1 month ago
duanjunwen 72b507a7be [feat] update MixtralPipelineForwards --> mixtral_model_forward; support zbv; 1 month ago
duanjunwen 9ee80fc828 [fix] MixtralForCausalLMPolicy get_held_layer support zbv; 1 month ago
wangbluo b635dd0669 fix 1 month ago
duanjunwen 3f5bec8dc4 [feat] support zbv in mixtral benchmark; 1 month ago
wangbluo 3532f77b90 fix 1 month ago
duanjunwen 531773ff54
Merge pull request #6077 from duanjunwen/dev/zero_bubble 1 month ago
duanjunwen cc500b3e25 [fix] fix mixtral policy; 2 months ago
duanjunwen 292a504bea [fix] fix mixtral policy; 2 months ago
duanjunwen f4d023ca6e Merge branch 'feature/zerobubble' of github.com:hpcaitech/ColossalAI into dev/zero_bubble 2 months ago
flybird11111 295dd2d9fe
[zerobubble] rebase main (#6075) 2 months ago
duanjunwen 6975c50f78 [fix] fix build ci; 2 months ago
duanjunwen 5c8bbf63a8 [feat] update optimizer bwd; ä¸ 2 months ago
duanjunwen d63479553c [feat] zerobubble support moehybridplugin; 2 months ago
flybird11111 af6aa9ed06
[plugin] hybrid support zero bubble pipeline (#6060) 2 months ago
duanjunwen b804fdc297
Merge pull request #6069 from duanjunwen/dev/zero_bubble 2 months ago
duanjunwen 1342a983b1 [fix] rm print & comments; 2 months ago
duanjunwen 64ceea746f [fix] remove chunk 0 stage 0 bwd b; u don't have to cal micrbatch's dx; 2 months ago
wangbluo 3fab92166e fix 2 months ago
duanjunwen bb0390c90d [fix] remove duplicate arg; rm comments; 2 months ago
duanjunwen c5503b0d80 [fix] fix test_pipeline_utils ci; 2 months ago
duanjunwen 45f17fc6cc [fix] rm comments; 2 months ago
duanjunwen a92e16719b [fix] fix zerobubble; support shardformer model type; 2 months ago
binmakeswell f4daf04270
add funding news (#6072) 2 months ago
wangbluo 6705dad41b fix 2 months ago
wangbluo 91ed32c256 fix 2 months ago
wangbluo 6fb1322db1 fix 2 months ago
wangbluo 65c8297710 fix the attn 2 months ago
wangbluo cfd9eda628 fix the ring attn 2 months ago
duanjunwen 83163fa70c [fix] fix traverse; traverse dict --> traverse tensor List; 2 months ago
duanjunwen fc8b016887 [fix] fix stage_indices; 2 months ago
binmakeswell cbaa104216
release FP8 news (#6068) 2 months ago
duanjunwen 8501202a35
Merge pull request #6065 from duanjunwen/dev/zero_bubble 2 months ago
duanjunwen 7e6f793c51 [fix] fix detach_output_obj clone; 2 months ago
duanjunwen 6c1e1550ae [fix] fix dumb clone; 2 months ago