Commit Graph

2690 Commits (c554b7f559b592c4d358db677c87658b11a6341c)
 

Author SHA1 Message Date
Wenhao Chen 6d41c3f2aa
[doc] update Coati README (#4405)
1 year ago
LuGY d86ddd9b29
[hotfix] fix unsafe async comm in zero (#4404)
1 year ago
Baizhou Zhang 6ccecc0c69
[gemini] fix tensor storage cleaning in state dict collection (#4396)
1 year ago
flybird1111 458ae331ad
[kernel] updated unittests for coloattention (#4389)
1 year ago
binmakeswell 089c365fa0
[doc] add Series A Funding and NeurIPS news (#4377)
1 year ago
flybird1111 f40b718959
[doc] Fix gradient accumulation doc. (#4349)
1 year ago
flybird1111 38b792aab2
[coloattention] fix import error (#4380)
1 year ago
flybird1111 25c57b9fb4
[fix] coloattention support flash attention 2 (#4347)
1 year ago
Wenhao Chen da4f7b855f
[chat] fix bugs and add unit tests (#4213)
1 year ago
Hongxin Liu 16bf4c0221
[test] remove useless tests (#4359)
1 year ago
caption 16c0acc01b
[hotfix] update gradio 3.11 to 3.34.0 (#4329)
1 year ago
Hongxin Liu 806477121d
[release] update version (#4332)
1 year ago
Wenhao Chen 75c5389037
[chat] fix compute_approx_kl (#4338)
1 year ago
LuGY 03654c0ce2
fix localhost measurement (#4320)
1 year ago
LuGY 45b08f08cb [zero] optimize the optimizer step time (#4221)
1 year ago
LuGY 1a49a5ea00 [zero] support shard optimizer state dict of zero (#4194)
1 year ago
LuGY dd7cc58299 [zero] add state dict for low level zero (#4179)
1 year ago
LuGY c668801d36 [zero] allow passing process group to zero12 (#4153)
1 year ago
LuGY 79cf1b5f33 [zero]support no_sync method for zero1 plugin (#4138)
1 year ago
LuGY c6ab96983a [zero] refactor low level zero for shard evenly (#4030)
1 year ago
Yuanchen 5187c96b7c
support session-based training (#4313)
1 year ago
binmakeswell ef4b99ebcd add llama example CI
1 year ago
yuxuan-lou 0991405361 [NFC] polish applications/Chat/coati/models/utils.py codestyle (#4277)
1 year ago
Zirui Zhu 9e512938f6 [NFC] polish applications/Chat/coati/trainer/strategies/base.py code style (#4278)
1 year ago
Ziheng Qin c972d65311 applications/Chat/.gitignore (#4279)
1 year ago
RichardoLuo 709e121cd5 [NFC] polish applications/Chat/coati/models/generation.py code style (#4275)
1 year ago
Yuanchen dc1b6127f9 [NFC] polish applications/Chat/inference/server.py code style (#4274)
1 year ago
アマデウス caa4433072 [NFC] fix format of application/Chat/coati/trainer/utils.py (#4273)
1 year ago
Xu Kai 1ce997daaf [NFC] polish applications/Chat/examples/train_reward_model.py code style (#4271)
1 year ago
dayellow a50d39a143 [NFC] fix: format (#4270)
1 year ago
Wenhao Chen fee553288b [NFC] polish runtime_preparation_pass style (#4266)
1 year ago
YeAnbang 3883db452c [NFC] polish unary_elementwise_generator.py code style (#4267)
1 year ago
shenggan 798cb72907 [NFC] polish applications/Chat/coati/trainer/base.py code style (#4260)
1 year ago
Zheng Zangwei (Alex Zheng) b2debdc09b [NFC] polish applications/Chat/coati/dataset/sft_dataset.py code style (#4259)
1 year ago
梁爽 abe4f971e0 [NFC] polish colossalai/booster/plugin/low_level_zero_plugin.py code style (#4256)
1 year ago
Yanjia0 c614a99d28 [NFC] polish colossalai/auto_parallel/offload/amp_optimizer.py code style (#4255)
1 year ago
ocd_with_naming 85774f0c1f [NFC] polish colossalai/cli/benchmark/utils.py code style (#4254)
1 year ago
CZYCW dee1c96344 [NFC] policy applications/Chat/examples/ray/mmmt_prompt.py code style (#4250)
1 year ago
Junming Wu 77c469e1ba [NFC] polish applications/Chat/coati/models/base/actor.py code style (#4248)
1 year ago
Camille Zhong 915ed8bed1 [NFC] polish applications/Chat/inference/requirements.txt code style (#4265)
1 year ago
Michelle 86cf6aed5b Fix/format (#4261)
1 year ago
Jianghai b366f1d99f [NFC] Fix format for mixed precision (#4253)
1 year ago
Hongxin Liu 02192a632e
[ci] support testmon core pkg change detection (#4305)
1 year ago
Baizhou Zhang c6f6005990
[checkpointio] Sharded Optimizer Checkpoint for Gemini Plugin (#4302)
1 year ago
Hongxin Liu fc5cef2c79
[lazy] support init on cuda (#4269)
1 year ago
Cuiqing Li 4b977541a8
[Kernels] added triton-implemented of self attention for colossal-ai (#4241)
1 year ago
binmakeswell 7ff11b5537
[example] add llama pretraining (#4257)
1 year ago
Jianghai 9a4842c571
revise shardformer readme (#4246)
1 year ago
github-actions[bot] 4e9b09c222
Automated submodule synchronization (#4217)
1 year ago
Frank Lee c1cf752021
[docker] fixed ninja build command (#4203)
1 year ago