Commit Graph

2965 Commits (ea088b5f75e9c9a79d67b370286da2a1508688c8)
 

Author SHA1 Message Date
Tong Li ea088b5f75 update train code
11 months ago
Tong Li 4b7f273022 add moe
11 months ago
ver217 63ee6fffe6 Merge branch 'main' into exp/mixtral
11 months ago
ver217 ce1cff26bd Merge branch 'main' into exp/mixtral
11 months ago
Elsa Granger d565df3821
[pipeline] A more general _communicate in p2p (#5062)
11 months ago
binmakeswell 7bc6969ce6
[doc] SwiftInfer release (#5236)
11 months ago
github-actions[bot] 4fb4a22a72
[format] applied code formatting on changed files in pull request 5234 (#5235)
11 months ago
binmakeswell b9b32b15e6
[doc] add Colossal-LLaMA-2-13B (#5234)
11 months ago
JIMMY ZHAO ce651270f1
[doc] Make leaderboard format more uniform and good-looking (#5231)
11 months ago
Camille Zhong 915b4652f3
[doc] Update README.md of Colossal-LLAMA2 (#5233)
11 months ago
Tong Li d992b55968
[Colossal-LLaMA-2] Release Colossal-LLaMA-2-13b-base model (#5224)
11 months ago
Wenhao Chen 196b85368b [pipeline]: add p2p fallback order and fix interleaved pp deadlock (#5214)
11 months ago
Wenhao Chen 931d0e0731 [pipeline]: support arbitrary batch size in forward_only mode (#5201)
11 months ago
Wenhao Chen 1810b9100f [pipeline]: fix p2p comm, add metadata cache and support llama interleaved pp (#5134)
11 months ago
digger yu b0b53a171c
[nfc] fix typo colossalai/shardformer/ (#5133)
11 months ago
Xuanlei Zhao 6b69f3085b update
11 months ago
flybird11111 451e9142b8
fix flash attn (#5209)
11 months ago
flybird11111 365671be10
fix-test (#5210)
11 months ago
Xuanlei Zhao 8ca8cf8ec3 update optim
11 months ago
Hongxin Liu 7f3400b560
[devops] update torch versoin in ci (#5217)
11 months ago
Wenhao Chen d799a3088f
[pipeline]: add p2p fallback order and fix interleaved pp deadlock (#5214)
11 months ago
Wenhao Chen 3c0d82b19b
[pipeline]: support arbitrary batch size in forward_only mode (#5201)
11 months ago
Xuanlei Zhao f037583bd2 update train
11 months ago
flybird11111 02d2328a04
support linear accumulation fusion (#5199)
11 months ago
Xuanlei Zhao 0b8c33f474 update
11 months ago
Xuanlei Zhao c1c6af6368 update
11 months ago
Xuanlei Zhao 0bb317d9e6 update
11 months ago
Xuanlei Zhao ccad7014c6 update optim
11 months ago
Xuanlei Zhao 44014faa67 fix optim
11 months ago
Xuanlei Zhao 0a3aae509b update utils and fwd bwd
11 months ago
Xuanlei Zhao a5580e6289 update test
11 months ago
Xuanlei Zhao 73aa406b96 update
11 months ago
Zhongkai Zhao 64519eb830
[doc] Update required third-party library list for testing and torch comptibility checking (#5207)
11 months ago
Xuanlei Zhao 570f5cd693 update pytest
11 months ago
Xuanlei Zhao 54b197cc02 update readme
11 months ago
Xuanlei Zhao 4922641098 script
11 months ago
Xuanlei Zhao d660a41850 update
11 months ago
Xuanlei Zhao b8fadb68a7 add pad
11 months ago
Xuanlei Zhao 23341687ed update
11 months ago
Xuanlei Zhao aa2e091dc6 update
11 months ago
Yuanchen eae01b6740
Improve logic for selecting metrics (#5196)
11 months ago
Wenhao Chen 4fa689fca1
[pipeline]: fix p2p comm, add metadata cache and support llama interleaved pp (#5134)
11 months ago
BlueRum af952673f7
polish readme in application/chat (#5194)
11 months ago
Xuanlei Zhao 7c5b1a585f update
12 months ago
flybird11111 681d9b12ef
[doc] update pytorch version in documents. (#5177)
12 months ago
Xuanlei Zhao ebd8cc579a update script
12 months ago
Xuanlei Zhao f66469e209 update
12 months ago
Yuanchen 3ff60d13b0
Fix ColossalEval (#5186)
12 months ago
Xuanlei Zhao 8aef2dba02 init
12 months ago
flybird11111 79718fae04
[shardformer] llama support DistCrossEntropy (#5176)
12 months ago