ColossalAI

Commit Graph

Author	SHA1	Message	Date
Xuanlei Zhao	54b197cc02	update readme	11 months ago
Xuanlei Zhao	4922641098	script	11 months ago
Xuanlei Zhao	d660a41850	update	11 months ago
Xuanlei Zhao	b8fadb68a7	add pad	11 months ago
Xuanlei Zhao	23341687ed	update	11 months ago
Xuanlei Zhao	aa2e091dc6	update	11 months ago
Xuanlei Zhao	7c5b1a585f	update	12 months ago
Xuanlei Zhao	ebd8cc579a	update script	12 months ago
Xuanlei Zhao	f66469e209	update	12 months ago
Xuanlei Zhao	8aef2dba02	init	12 months ago
Yuanchen	cefdc32615	[ColossalEval] Support GSM, Data Leakage Evaluation and Tensor Parallel (#5169 ) * Support GSM, Data Leakage Evaluation and Tensor Parallel * remove redundant code and update inference.py in examples/gpt_evaluation --------- Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>	12 months ago
Michelle	b07a6f4e27	[colossalqa] fix pangu api (#5170 ) * fix pangu api * add comment	12 months ago
Yuanchen	b397104438	[Colossal-Llama-2] Add finetuning Colossal-Llama-2 example (#4878 ) * Add finetuning Colossal-Llama-2 example * Add finetuning Colossal-Llama-2 example 2 * Add finetuning Colossal-Llama-2 example and support NEFTuning * Add inference example and refine neftune * Modify readme file * update the imports --------- Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com> Co-authored-by: Camille Zhong <44392324+Camille7777@users.noreply.github.com>	12 months ago
Michelle	368b5e3d64	[doc] fix colossalqa document (#5146 ) * fix doc * modify doc	1 year ago
Michelle	c7fd9a5213	[ColossalQA] refactor server and webui & add new feature (#5138 ) * refactor server and webui & add new feature * add requirements * modify readme and ui	1 year ago
github-actions[bot]	f6731db67c	[format] applied code formatting on changed files in pull request 5115 (#5118 ) Co-authored-by: github-actions <github-actions@github.com>	1 year ago
digger yu	9110406a47	fix typo change JOSNL TO JSONL etc. (#5116 )	1 year ago
Zian(Andy) Zheng	7b789f4dd2	[FEATURE] Add Safety Eval Datasets to ColossalEval (#5095 ) * add safetybench and cvalues(responsibility) eval dataset * Modify code according to review suggestions --------- Co-authored-by: Orion-Zheng <zhengzian@u.nus.edu>	1 year ago
digger yu	d5661f0f25	[nfc] fix typo change directoty to directory (#5111 )	1 year ago
YeAnbang	e53e729d8e	[Feature] Add document retrieval QA (#5020 ) * add langchain * add langchain * Add files via upload * add langchain * fix style * fix style: remove extra space * add pytest; modified retriever * add pytest; modified retriever * add tests to build_on_pr.yml * fix build_on_pr.yml * fix build on pr; fix environ vars * seperate unit tests for colossalqa from build from pr * fix container setting; fix environ vars * commented dev code * add incremental update * remove stale code * fix style * change to sha3 224 * fix retriever; fix style; add unit test for document loader * fix ci workflow config * fix ci workflow config * add set cuda visible device script in ci * fix doc string * fix style; update readme; refactored * add force log info * change build on pr, ignore colossalqa * fix docstring, captitalize all initial letters * fix indexing; fix text-splitter * remove debug code, update reference * reset previous commit * update LICENSE update README add key-value mode, fix bugs * add files back * revert force push * remove junk file * add test files * fix retriever bug, add intent classification * change conversation chain design * rewrite prompt and conversation chain * add ui v1 * ui v1 * fix atavar * add header * Refactor the RAG Code and support Pangu * Refactor the ColossalQA chain to Object-Oriented Programming and the UI demo. * resolved conversation. tested scripts under examples. web demo still buggy * fix ci tests * Some modifications to add ChatGPT api * modify llm.py and remove unnecessary files * Delete applications/ColossalQA/examples/ui/test_frontend_input.json * Remove OpenAI api key * add colossalqa * move files * move files * move files * move files * fix style * Add Readme and fix some bugs. * Add something to readme and modify some code * modify a directory name for clarity * remove redundant directory * Correct a type in llm.py * fix AI prefix * fix test_memory.py * fix conversation * fix some erros and typos * Fix a missing import in RAG_ChatBot.py * add colossalcloud LLM wrapper, correct issues in code review --------- Co-authored-by: YeAnbang <anbangy2@outlook.com> Co-authored-by: Orion-Zheng <zheng_zian@u.nus.edu> Co-authored-by: Zian(Andy) Zheng <62330719+Orion-Zheng@users.noreply.github.com> Co-authored-by: Orion-Zheng <zhengzian@u.nus.edu>	1 year ago
Orion-Zheng	43ad0d9ef0	fix wrong EOS token in ColossalChat	1 year ago
Yuanchen	239cd92eff	Support mtbench (#5025 ) Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>	1 year ago
Yuanchen	abe071b663	fix ColossalEval (#4992 ) Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>	1 year ago
github-actions[bot]	a41cf88e9b	[format] applied code formatting on changed files in pull request 4908 (#4918 ) Co-authored-by: github-actions <github-actions@github.com>	1 year ago
Zian(Andy) Zheng	7768afbad0	Update flash_attention_patch.py To be compatible with the new change in the Transformers library, where a new argument 'padding_mask' was added to forward function of attention layer. https://github.com/huggingface/transformers/pull/25598	1 year ago
Camille Zhong	652adc2215	Update README.md	1 year ago
Camille Zhong	afe10a85fd	Update README.md	1 year ago
Camille Zhong	3043d5d676	Update modelscope link in README.md add modelscope link	1 year ago
Tong Li	ed06731e00	update Colossal (#4832 )	1 year ago
binmakeswell	822051d888	[doc] update slack link (#4823 )	1 year ago
Yuanchen	1fa8c5e09f	Update Qwen-7B results (#4821 ) Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>	1 year ago
flybird11111	be400a0936	[chat] fix gemini strategy (#4698 ) * [chat] fix gemini strategy * [chat] fix gemini strategy * [chat] fix gemini strategy * [chat] fix gemini strategy * g# This is a combination of 2 commits. [chat] fix gemini strategy fox * [chat] fix gemini strategy update llama2 example [chat] fix gemini strategy * [fix] fix gemini strategy * [fix] fix gemini strategy * [fix] fix gemini strategy * [fix] fix gemini strategy * [fix] fix gemini strategy * [fix] fix gemini strategy * [fix] fix gemini strategy * [fix] fix gemini strategy * [fix] fix gemini strategy * [fix] fix gemini strategy * fix * fix * fix * fix * fix * Update train_prompts.py	1 year ago
Chandler-Bing	b6cf0aca55	[hotfix] change llama2 Colossal-LLaMA-2 script filename (#4800 ) change filename: pretraining.py -> trainin.py there is no file named pretraing.py. wrong writing	1 year ago
Tong Li	8cbce6184d	update	1 year ago
Tong Li	bd014673b0	update readme	1 year ago
binmakeswell	d512a4d38d	[doc] add llama2 domain-specific solution news (#4789 ) * [doc] add llama2 domain-specific solution news	1 year ago
Yuanchen	ce777853ae	[feature] ColossalEval: Evaluation Pipeline for LLMs (#4786 ) * Add ColossalEval * Delete evaluate in Chat --------- Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com> Co-authored-by: Tong Li <tong.li352711588@gmail.com>	1 year ago
Tong Li	74aa7d964a	initial commit: add colossal llama 2 (#4784 )	1 year ago
Wenhao Chen	901ab1eedd	[chat]: add lora merge weights config (#4766 ) * feat: modify lora merge weights fn * feat: add lora merge weights config	1 year ago
Wenhao Chen	7b9b86441f	[chat]: update rm, add wandb and fix bugs (#4471 ) * feat: modify forward fn of critic and reward model * feat: modify calc_action_log_probs * to: add wandb in sft and rm trainer * feat: update train_sft * feat: update train_rm * style: modify type annotation and add warning * feat: pass tokenizer to ppo trainer * to: modify trainer base and maker base * feat: add wandb in ppo trainer * feat: pass tokenizer to generate * test: update generate fn tests * test: update train tests * fix: remove action_mask * feat: remove unused code * fix: fix wrong ignore_index * fix: fix mock tokenizer * chore: update requirements * revert: modify make_experience * fix: fix inference * fix: add padding side * style: modify _on_learn_batch_end * test: use mock tokenizer * fix: use bf16 to avoid overflow * fix: fix workflow * [chat] fix gemini strategy * [chat] fix * sync: update colossalai strategy * fix: fix args and model dtype * fix: fix checkpoint test * fix: fix requirements * fix: fix missing import and wrong arg * fix: temporarily skip gemini test in stage 3 * style: apply pre-commit * fix: temporarily skip gemini test in stage 1&2 --------- Co-authored-by: Mingyan Jiang <1829166702@qq.com>	1 year ago
Hongxin Liu	079bf3cb26	[misc] update pre-commit and run all files (#4752 ) * [misc] update pre-commit * [misc] run pre-commit * [misc] remove useless configuration files * [misc] ignore cuda for clang-format	1 year ago
digger yu	e4fc57c3de	Optimized some syntax errors in the documentation and code under applications/ (#4127 ) Co-authored-by: flybird11111 <1829166702@qq.com>	1 year ago
Hongxin Liu	a39a5c66fe	Merge branch 'main' into feature/shardformer	1 year ago
Ying Liu	c648dc093f	fix colossalai version in coati examples	1 year ago
yingliu-hpc	1467e3b41b	[coati] add chatglm model (#4539 ) * update configuration of chatglm and add support in coati * add unit test & update chatglm default config & fix bos index issue * remove chatglm due to oom * add dataset pkg in requirement-text * fix parameter issue in test_models * add ref in tokenize & rm unnessary parts * separate source & target tokenization in chatglm * add unit test to chatglm * fix test dataset issue * update truncation of chatglm * fix Colossalai version * fix colossal ai version in test	1 year ago
Michelle	285fe7ba71	[chat] update config and prompt (#4139 ) * update config and prompt * update config --------- Co-authored-by: Qianran Ma <qianranm@luchentech.com>	1 year ago
Hongxin Liu	26e29d58f0	[devops] add large-scale distributed test marker (#4452 ) * [test] remove cpu marker * [test] remove gpu marker * [test] update pytest markers * [ci] update unit test ci	1 year ago
Wenhao Chen	6d41c3f2aa	[doc] update Coati README (#4405 ) * style: apply formatter * fix: add outdated warnings * docs: add dataset format and polish * docs: polish README * fix: fix json format * fix: fix typos * revert: revert 7b example	1 year ago
Wenhao Chen	da4f7b855f	[chat] fix bugs and add unit tests (#4213 ) * style: rename replay buffer Experience replay is typically for off policy algorithms. Use this name in PPO maybe misleading. * fix: fix wrong zero2 default arg * test: update experience tests * style: rename zero_pad fn * fix: defer init in CycledDataLoader * test: add benchmark test * style: rename internal fn of generation * style: rename internal fn of lora * fix: remove unused loss fn * fix: remove unused utils fn * refactor: remove generate_with_actor fn * fix: fix type annotation * test: add models tests * fix: skip llama due to long execution time * style: modify dataset * style: apply formatter * perf: update reward dataset * fix: fix wrong IGNORE_INDEX in sft dataset * fix: remove DataCollatorForSupervisedDataset * test: add dataset tests * style: apply formatter * style: rename test_ci to test_train * feat: add llama in inference * test: add inference tests * test: change test scripts directory * fix: update ci * fix: fix typo * fix: skip llama due to oom * fix: fix file mod * style: apply formatter * refactor: remove duplicated llama_gptq * style: apply formatter * to: update rm test * feat: add tokenizer arg * feat: add download model script * test: update train tests * fix: modify gemini load and save pretrained * test: update checkpoint io test * to: modify nproc_per_node * fix: do not remove existing dir * fix: modify save path * test: add random choice * fix: fix sft path * fix: enlarge nproc_per_node to avoid oom * fix: add num_retry * fix: make lora config of rm and critic consistent * fix: add warning about lora weights * fix: skip some gpt2 tests * fix: remove grad ckpt in rm and critic due to errors * refactor: directly use Actor in train_sft * test: add more arguments * fix: disable grad ckpt when using lora * fix: fix save_pretrained and related tests * test: enable zero2 tests * revert: remove useless fn * style: polish code * test: modify test args	1 year ago
Wenhao Chen	75c5389037	[chat] fix compute_approx_kl (#4338 )	1 year ago

1 2 3 4 5

211 Commits (54b197cc02f2b2a78e30897689ae56258a5271a7)