190 Commits (fcf776ff1b4c72e5f9afc98e744fcb9dd05804f6)

Author SHA1 Message Date
linsj20 fcf776ff1b
[Feature] LoRA rebased to main branch (#5622) 7 months ago
linsj20 52a2dded36
[Feature] qlora support (#5586) 7 months ago
github-actions[bot] a41cf88e9b
[format] applied code formatting on changed files in pull request 4908 (#4918) 1 year ago
Zian(Andy) Zheng 7768afbad0 Update flash_attention_patch.py 1 year ago
Camille Zhong 652adc2215 Update README.md 1 year ago
Camille Zhong afe10a85fd Update README.md 1 year ago
Camille Zhong 3043d5d676 Update modelscope link in README.md 1 year ago
Tong Li ed06731e00
update Colossal (#4832) 1 year ago
binmakeswell 822051d888
[doc] update slack link (#4823) 1 year ago
Yuanchen 1fa8c5e09f
Update Qwen-7B results (#4821) 1 year ago
flybird11111 be400a0936
[chat] fix gemini strategy (#4698) 1 year ago
Chandler-Bing b6cf0aca55
[hotfix] change llama2 Colossal-LLaMA-2 script filename (#4800) 1 year ago
Tong Li 8cbce6184d update 1 year ago
Tong Li bd014673b0 update readme 1 year ago
binmakeswell d512a4d38d
[doc] add llama2 domain-specific solution news (#4789) 1 year ago
Yuanchen ce777853ae
[feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) 1 year ago
Tong Li 74aa7d964a
initial commit: add colossal llama 2 (#4784) 1 year ago
Wenhao Chen 901ab1eedd
[chat]: add lora merge weights config (#4766) 1 year ago
Wenhao Chen 7b9b86441f
[chat]: update rm, add wandb and fix bugs (#4471) 1 year ago
Hongxin Liu 079bf3cb26
[misc] update pre-commit and run all files (#4752) 1 year ago
digger yu e4fc57c3de
Optimized some syntax errors in the documentation and code under applications/ (#4127) 1 year ago
Ying Liu c648dc093f fix colossalai version in coati examples 1 year ago
yingliu-hpc 1467e3b41b
[coati] add chatglm model (#4539) 1 year ago
Michelle 285fe7ba71
[chat] update config and prompt (#4139) 1 year ago
Hongxin Liu 26e29d58f0
[devops] add large-scale distributed test marker (#4452) 1 year ago
Wenhao Chen 6d41c3f2aa
[doc] update Coati README (#4405) 1 year ago
Wenhao Chen da4f7b855f
[chat] fix bugs and add unit tests (#4213) 1 year ago
Wenhao Chen 75c5389037
[chat] fix compute_approx_kl (#4338) 1 year ago
Yuanchen 5187c96b7c
support session-based training (#4313) 1 year ago
yuxuan-lou 0991405361 [NFC] polish applications/Chat/coati/models/utils.py codestyle (#4277) 1 year ago
Zirui Zhu 9e512938f6 [NFC] polish applications/Chat/coati/trainer/strategies/base.py code style (#4278) 1 year ago
Ziheng Qin c972d65311 applications/Chat/.gitignore (#4279) 1 year ago
RichardoLuo 709e121cd5 [NFC] polish applications/Chat/coati/models/generation.py code style (#4275) 1 year ago
Yuanchen dc1b6127f9 [NFC] polish applications/Chat/inference/server.py code style (#4274) 1 year ago
アマデウス caa4433072 [NFC] fix format of application/Chat/coati/trainer/utils.py (#4273) 1 year ago
Xu Kai 1ce997daaf [NFC] polish applications/Chat/examples/train_reward_model.py code style (#4271) 1 year ago
shenggan 798cb72907 [NFC] polish applications/Chat/coati/trainer/base.py code style (#4260) 1 year ago
Zheng Zangwei (Alex Zheng) b2debdc09b [NFC] polish applications/Chat/coati/dataset/sft_dataset.py code style (#4259) 1 year ago
CZYCW dee1c96344 [NFC] policy applications/Chat/examples/ray/mmmt_prompt.py code style (#4250) 1 year ago
Junming Wu 77c469e1ba [NFC] polish applications/Chat/coati/models/base/actor.py code style (#4248) 1 year ago
Camille Zhong 915ed8bed1 [NFC] polish applications/Chat/inference/requirements.txt code style (#4265) 1 year ago
Frank Lee f447ca1811 [chat] removed cache file (#4155) 1 year ago
wukong1992 c1c672d0f0 [shardformer] shardformer support t5 model (#3994) 1 year ago
Wenhao Chen 3d8d5d0d58
[chat] use official transformers and fix some issues (#4117) 1 year ago
Wenhao Chen edd75a59ea
[chat] remove naive strategy and split colossalai strategy (#4094) 1 year ago
Wenhao Chen b03d64d010
[chat] refactor trainer class (#4080) 1 year ago
Baizhou Zhang 4da324cd60
[hotfix]fix argument naming in docs and examples (#4083) 1 year ago
Michelle e89b127d8e
[chat]: fix chat evaluation possible bug (#4064) 1 year ago
Wenhao Chen 153b957a1b
[chat] refactor strategy class with booster api (#3987) 1 year ago
digger yu 727c4598a9
[nfc] fix dim not defined and fix typo (#3991) 1 year ago