ColossalAI

Commit Graph

Author	SHA1	Message	Date
YeAnbang	ed97d3a5d3	[Chat] fix readme (#5989 ) * fix readme * fix readme, tokenization fully tested * fix readme, tokenization fully tested * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Co-authored-by: root <root@notebook-8f919155-6035-47b4-9c6f-1be133b9e2c9-0.notebook-8f919155-6035-47b4-9c6f-1be133b9e2c9.colossal-ai.svc.cluster.local> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2024-08-12 14:55:17 +08:00
YeAnbang	0b2d55c4ab	Support overall loss, update KTO logging	2024-08-02 06:51:38 +00:00
Tong Li	1aeb5e8847	[hotfix] Remove unused plan section (#5957 ) * remove readme * fix readme * update	2024-07-31 17:47:46 +08:00
YeAnbang	66fbf2ecb7	Update README.md (#5958 )	2024-07-31 17:44:09 +08:00
YeAnbang	30f4e31a33	[Chat] Fix lora (#5946 ) * fix merging * remove filepath * fix style	2024-07-31 14:10:17 +08:00
YeAnbang	150505cbb8	Merge branch 'kto' of https://github.com/hpcaitech/ColossalAI into kto	2024-07-19 10:11:05 +00:00
YeAnbang	d49550fb49	refactor tokenization	2024-07-19 10:10:48 +00:00
Tong Li	d08c99be0d	Merge branch 'main' into kto	2024-07-19 15:23:31 +08:00
Tong Li	f585d4e38e	[ColossalChat] Hotfix for ColossalChat (#5910 ) * add ignore and tiny llama * fix path issue * run style * fix issue * update bash * add ignore and tiny llama * fix path issue * run style * fix issue * update bash * fix ddp issue * add Qwen 1.5 32B	2024-07-19 13:40:07 +08:00
YeAnbang	544b7a38a1	fix style, add kto data sample	2024-07-18 08:38:56 +00:00
YeAnbang	09d5ffca1a	add kto	2024-07-18 07:54:11 +00:00
pre-commit-ci[bot]	8a9721bafe	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2024-07-10 10:44:32 +00:00
YeAnbang	d888c3787c	add benchmark for sft, dpo, simpo, orpo. Add benchmarking result. Support lora with gradient checkpoint	2024-07-10 10:17:08 +00:00
YeAnbang	c8d1b4a968	add orpo	2024-06-27 07:20:28 +00:00
YeAnbang	82aecd6374	add SimPO	2024-06-24 02:12:20 +00:00
YeAnbang	2abdede1d7	fix readme	2024-06-10 01:08:42 +00:00
YeAnbang	0d7ff10ea5	replace the customized dataloader setup with the build-in one	2024-06-07 09:43:42 +00:00
YeAnbang	0b4a33548c	moupdate ci tests, st ci test cases passed, tp failed in generation for ppo, sp is buggy	2024-06-07 07:01:31 +00:00
YeAnbang	7a7e86987d	upgrade colossal-chat support tp_group>1, add sp for sft	2024-06-07 07:01:30 +00:00
YeAnbang	df5e9c53cf	[ColossalChat] Update RLHF V2 (#5286 ) * Add dpo. Fix sft, ppo, lora. Refactor all * fix and tested ppo * 2 nd round refactor * add ci tests * fix ci * fix ci * fix readme, style * fix readme style * fix style, fix benchmark * reproduce benchmark result, remove useless files * rename to ColossalChat * use new image * fix ci workflow * fix ci * use local model/tokenizer for ci tests * fix ci * fix ci * fix ci * fix ci timeout * fix rm progress bar. fix ci timeout * fix ci * fix ci typo * remove 3d plugin from ci temporary * test environment * cannot save optimizer * support chat template * fix readme * fix path * test ci locally * restore build_or_pr * fix ci data path * fix benchmark * fix ci, move ci tests to 3080, disable fast tokenizer * move ci to 85 * support flash attention 2 * add all-in-one data preparation script. Fix colossal-llama2-chat chat template * add hardware requirements * move ci test data * fix save_model, add unwrap * fix missing bos * fix missing bos; support grad accumulation with gemini * fix ci * fix ci * fix ci * fix llama2 chat template config * debug sft * debug sft * fix colossalai version requirement * fix ci * add sanity check to prevent NaN loss * fix requirements * add dummy data generation script * add dummy data generation script * add dummy data generation script * add dummy data generation script * update readme * update readme * update readme and ignore * fix logger bug * support parallel_output * modify data preparation logic * fix tokenization * update lr * fix inference * run pre-commit --------- Co-authored-by: Tong Li <tong.li352711588@gmail.com>	2024-03-29 14:12:29 +08:00

20 Commits (dafda0fb7082506ad76b5deff3024b3d5dbb904b)