3742 Commits (dee63cc5ef7e6147c7390a5a3cfdab9e421322e8)
 

Author SHA1 Message Date
Hanks dee63cc5ef
Merge pull request #6096 from BurkeHulk/hotfix/lora_ckpt 1 month ago
BurkeHulk 6d6cafabe2 pre-commit fix 1 month ago
BurkeHulk b10339df7c fix lora ckpt save format (ColoTensor to Tensor) 1 month ago
Hongxin Liu 19baab5fd5
[release] update version (#6094) 1 month ago
Hongxin Liu 58d8b8a2dd
[misc] fit torch api upgradation and remove legecy import (#6093) 1 month ago
Hongxin Liu 5ddad486ca
[fp8] add fallback and make compile option configurable (#6092) 1 month ago
botbw 3b1d7d1ae8 [chore] refactor 1 month ago
botbw 2bcd0b6844 [ckpt] add safetensors util 1 month ago
Hongxin Liu cd61353bae
[pipeline] hotfix backward for multiple outputs (#6090) 1 month ago
Wenxuan Tan 62c13e7969
[Ring Attention] Improve comments (#6085) 1 month ago
Wang Binluo dcd41d0973
Merge pull request #6071 from wangbluo/ring_attention 1 month ago
wangbluo 83cf2f84fb fix 1 month ago
wangbluo bc7eeade33 fix 1 month ago
wangbluo fd92789af2 fix 1 month ago
wangbluo 6be9862aaf fix 1 month ago
wangbluo 3dc08c8a5a fix 1 month ago
wangbluo 8ff7d0c780 fix 1 month ago
wangbluo fe9208feac fix 1 month ago
wangbluo 3201377e94 fix 1 month ago
wangbluo 23199e34cc fix 1 month ago
wangbluo d891e50617 fix 1 month ago
wangbluo e1e86f9f1f fix 1 month ago
Tong Li 4c8e85ee0d
[Coati] Train DPO using PP (#6054) 1 month ago
wangbluo 703bb5c18d fix the test 1 month ago
wangbluo 4e0e99bb6a fix the test 1 month ago
wangbluo 1507a7528f fix 1 month ago
wangbluo 0002ae5956 fix 1 month ago
Hongxin Liu dc2cdaf3e8
[shardformer] optimize seq parallelism (#6086) 1 month ago
wangbluo efe3042bb2 fix 1 month ago
梁爽 6b2c506fc5
Update README.md (#6087) 1 month ago
wangbluo 5ecc27e150 fix 1 month ago
wangbluo f98384aef6 fix 1 month ago
Hongxin Liu 646b3c5a90
[shardformer] fix linear 1d row and support uneven splits for fused qkv linear (#6084) 1 month ago
wangbluo b635dd0669 fix 1 month ago
wangbluo 3532f77b90 fix 1 month ago
wangbluo 3fab92166e fix 2 months ago
binmakeswell f4daf04270
add funding news (#6072) 2 months ago
wangbluo 6705dad41b fix 2 months ago
wangbluo 91ed32c256 fix 2 months ago
wangbluo 6fb1322db1 fix 2 months ago
wangbluo 65c8297710 fix the attn 2 months ago
wangbluo cfd9eda628 fix the ring attn 2 months ago
binmakeswell cbaa104216
release FP8 news (#6068) 2 months ago
Hongxin Liu dabc2e7430
[release] update version (#6062) 2 months ago
Camille Zhong f9546ba0be
[ColossalEval] support for vllm (#6056) 2 months ago
botbw 4fa6b9509c
[moe] add parallel strategy for shared_expert && fix test for deepseek (#6063) 2 months ago
Wang Binluo 63314ce4e4
Merge pull request #6064 from wangbluo/fix_attn 2 months ago
wangbluo 10e4f7da72 fix 2 months ago
Wang Binluo 37e35230ff
Merge pull request #6061 from wangbluo/sp_fix 2 months ago
wangbluo 827ef3ee9a fix 2 months ago