ColossalAI

Commit Graph

Author	SHA1	Message	Date
Baizhou Zhang	c1535ccbba	[doc] fix docs about booster api usage (#3898 )	2023-06-06 13:36:11 +08:00
Hongxin Liu	ec9bbc0094	[devops] improving testmon cache (#3902 ) * [devops] improving testmon cache * [devops] fix branch name with slash * [devops] fix branch name with slash * [devops] fix edit action * [devops] fix edit action * [devops] fix edit action * [devops] fix edit action * [devops] fix edit action * [devops] fix edit action * [devops] update readme	2023-06-06 11:32:31 +08:00
Yuanchen	57a6d7685c	support evaluation for english (#3880 ) Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>	2023-06-05 21:24:21 +08:00
digger yu	1878749753	[nfc] fix typo colossalai/nn (#3887 ) * fix typo colossalai/autochunk auto_parallel amp * fix typo colossalai/auto_parallel nn utils etc. * fix typo colossalai/auto_parallel autochunk fx/passes etc. * fix typo docs/ * change placememt_policy to placement_policy in docs/ and examples/ * fix typo colossalai/ applications/ * fix typo colossalai/cli fx kernel * fix typo colossalai/nn * revert change warmuped	2023-06-05 16:04:27 +08:00
Hongxin Liu	ae02d4e4f7	[bf16] add bf16 support (#3882 ) * [bf16] add bf16 support for fused adam (#3844) * [bf16] fused adam kernel support bf16 * [test] update fused adam kernel test * [test] update fused adam test * [bf16] cpu adam and hybrid adam optimizers support bf16 (#3860) * [bf16] implement mixed precision mixin and add bf16 support for low level zero (#3869) * [bf16] add mixed precision mixin * [bf16] low level zero optim support bf16 * [text] update low level zero test * [text] fix low level zero grad acc test * [bf16] add bf16 support for gemini (#3872) * [bf16] gemini support bf16 * [test] update gemini bf16 test * [doc] update gemini docstring * [bf16] add bf16 support for plugins (#3877) * [bf16] add bf16 support for legacy zero (#3879) * [zero] init context support bf16 * [zero] legacy zero support bf16 * [test] add zero bf16 test * [doc] add bf16 related docstring for legacy zero	2023-06-05 15:58:31 +08:00
jiangmingyan	07cb21142f	[doc]update moe chinese document. (#3890 ) * [doc]update-moe * [doc]update-moe * [doc]update-moe * [doc]update-moe * [doc]update-moe	2023-06-05 15:57:54 +08:00
Liu Ziming	8065cc5fba	Modify torch version requirement to adapt torch 2.0 (#3896 )	2023-06-05 15:57:35 +08:00
Hongxin Liu	dbb32692d2	[lazy] refactor lazy init (#3891 ) * [lazy] remove old lazy init * [lazy] refactor lazy init folder structure * [lazy] fix lazy tensor deepcopy * [test] update lazy init test	2023-06-05 14:20:47 +08:00
Maruyama_Aya	25447d4407	modify path	2023-06-05 11:47:07 +08:00
Maruyama_Aya	42e3232bc0	roll back	2023-06-02 17:00:57 +08:00
Maruyama_Aya	60ec33bb18	Add a new example of Dreambooth training using the booster API	2023-06-02 16:50:51 +08:00
digger yu	70c8cdecf4	[nfc] fix typo colossalai/cli fx kernel (#3847 ) * fix typo colossalai/autochunk auto_parallel amp * fix typo colossalai/auto_parallel nn utils etc. * fix typo colossalai/auto_parallel autochunk fx/passes etc. * fix typo docs/ * change placememt_policy to placement_policy in docs/ and examples/ * fix typo colossalai/ applications/ * fix typo colossalai/cli fx kernel	2023-06-02 15:02:45 +08:00
Maruyama_Aya	46503c35dd	Modify torch version requirement to adapt torch 2.0	2023-06-01 14:30:51 +08:00
jiangmingyan	281b33f362	[doc] update document of zero with chunk. (#3855 ) * [doc] fix title of mixed precision * [doc]update document of zero with chunk * [doc] update document of zero with chunk, fix * [doc] update document of zero with chunk, fix * [doc] update document of zero with chunk, fix * [doc] update document of zero with chunk, add doc test * [doc] update document of zero with chunk, add doc test * [doc] update document of zero with chunk, fix installation * [doc] update document of zero with chunk, fix zero with chunk doc * [doc] update document of zero with chunk, fix zero with chunk doc	2023-05-30 18:41:56 +08:00
jiangmingyan	5f79008c4a	[example] update gemini examples (#3868 ) * [example]update gemini examples * [example]update gemini examples	2023-05-30 18:41:41 +08:00
Yuanchen	2506e275b8	[evaluation] improvement on evaluation (#3862 ) * fix a bug when the config file contains one category but the answer file doesn't contains that category * fix Chinese prompt file * support gpt-3.5-turbo and gpt-4 evaluation * polish and update README * resolve pr comments --------- Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>	2023-05-30 11:48:41 +08:00
jiangmingyan	b0474878bf	[doc] update nvme offload documents. (#3850 )	2023-05-26 01:22:01 +08:00
Frank Lee	ae959a72a5	[workflow] fixed workflow check for docker build (#3849 )	2023-05-25 16:42:34 +08:00
Frank Lee	d42b1be09d	[release] bump to v0.3.0 (#3830 )	2023-05-25 16:20:07 +08:00
digger yu	e2d81eba0d	[nfc] fix typo colossalai/ applications/ (#3831 ) * fix typo colossalai/autochunk auto_parallel amp * fix typo colossalai/auto_parallel nn utils etc. * fix typo colossalai/auto_parallel autochunk fx/passes etc. * fix typo docs/ * change placememt_policy to placement_policy in docs/ and examples/ * fix typo colossalai/ applications/	2023-05-25 16:19:41 +08:00
jiangmingyan	a64df3fa97	[doc] update document of gemini instruction. (#3842 ) * [doc] update meet_gemini.md * [doc] update meet_gemini.md * [doc] fix parentheses * [doc] fix parentheses * [doc] fix doc test * [doc] fix doc test * [doc] fix doc	2023-05-25 14:58:01 +08:00
Frank Lee	54e97ed7ea	[workflow] supported test on CUDA 10.2 (#3841 )	2023-05-25 14:14:34 +08:00
wukong1992	3229f93e30	[booster] add warning for torch fsdp plugin doc (#3833 )	2023-05-25 14:00:02 +08:00
Hongxin Liu	7c9f2ed6dd	[dtensor] polish sharding spec docstring (#3838 ) * [dtensor] polish sharding spec docstring * [dtensor] polish sharding spec example docstring	2023-05-25 13:09:42 +08:00
Frank Lee	84500b7799	[workflow] fixed testmon cache in build CI (#3806 ) * [workflow] fixed testmon cache in build CI * polish code	2023-05-24 14:59:40 +08:00
digger yu	518b31c059	[docs] change placememt_policy to placement_policy (#3829 ) * fix typo colossalai/autochunk auto_parallel amp * fix typo colossalai/auto_parallel nn utils etc. * fix typo colossalai/auto_parallel autochunk fx/passes etc. * fix typo docs/ * change placememt_policy to placement_policy in docs/ and examples/	2023-05-24 14:51:49 +08:00
digger yu	e90fdb1000	fix typo docs/	2023-05-24 13:57:43 +08:00
Yuanchen	34966378e8	[evaluation] add automatic evaluation pipeline (#3821 ) * add functions for gpt evaluation * add automatic eval Update eval.py * using jload and modify the type of answers1 and answers2 * Update eval.py Update eval.py * Update evaluator.py * support gpt evaluation * update readme.md update README.md update READNE.md modify readme.md * add Chinese example for config, battle prompt and evaluation prompt file * remove GPT-4 config * remove sample folder --------- Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com> Co-authored-by: Camille Zhong <44392324+Camille7777@users.noreply.github.com>	2023-05-24 11:18:23 +08:00
Frank Lee	05b8a8de58	[workflow] changed to doc build to be on schedule and release (#3825 ) * [workflow] changed to doc build to be on schedule and release * polish code	2023-05-24 10:50:19 +08:00
Yanming W	269150b6f4	[Docker] Fix a couple of build issues (#3691 )	2023-05-24 10:22:51 +08:00
digger yu	7f8203af69	fix typo colossalai/auto_parallel autochunk fx/passes etc. (#3808 )	2023-05-24 09:01:50 +08:00
jiangmingyan	725365f297	Merge pull request #3810 from jiangmingyan/amp [doc] update amp document	2023-05-23 18:58:16 +08:00
jiangmingyan	278fcbc444	[doc]fix	2023-05-23 17:53:11 +08:00
jiangmingyan	8aa1fb2c7f	[doc]fix	2023-05-23 17:50:30 +08:00
Frank Lee	1e3b64f26c	[workflow] enblaed doc build from a forked repo (#3815 )	2023-05-23 17:49:53 +08:00
Hongxin Liu	19d153057e	[doc] add warning about fsdp plugin (#3813 )	2023-05-23 17:16:10 +08:00
wukong1992	6b305a99d6	[booster] torch fsdp fix ckpt (#3788 )	2023-05-23 16:58:45 +08:00
jiangmingyan	c425a69d52	[doc] add removed change of config.py	2023-05-23 16:42:36 +08:00
jiangmingyan	75272ef37b	[doc] add removed warning	2023-05-23 16:34:30 +08:00
Mingyan Jiang	a520610bd9	[doc] update amp document	2023-05-23 16:20:29 +08:00
Mingyan Jiang	1167bf5b10	[doc] update amp document	2023-05-23 16:20:17 +08:00
Mingyan Jiang	8c62e50dbb	[doc] update amp document	2023-05-23 16:20:01 +08:00
digger yu	9265f2d4d7	[NFC]fix typo colossalai/auto_parallel nn utils etc. (#3779 ) * fix typo colossalai/autochunk auto_parallel amp * fix typo colossalai/auto_parallel nn utils etc.	2023-05-23 15:28:20 +08:00
jiangmingyan	e871e342b3	[API] add docstrings and initialization to apex amp, naive amp (#3783 ) * [mixed_precison] add naive amp demo * [mixed_precison] add naive amp demo * [api] add docstrings and initialization to apex amp, naive amp * [api] add docstring to apex amp/ naive amp * [api] add docstring to apex amp/ naive amp * [api] add docstring to apex amp/ naive amp * [api] add docstring to apex amp/ naive amp * [api] add docstring to apex amp/ naive amp * [api] add docstring to apex amp/ naive amp * [api] fix * [api] fix	2023-05-23 15:17:24 +08:00
Frank Lee	615e2e5fc1	[test] fixed lazy init test import error (#3799 )	2023-05-23 11:57:15 +08:00
Frank Lee	ad93c736ea	[workflow] enable testing for develop & feature branch (#3801 )	2023-05-23 11:21:15 +08:00
jiangmingyan	ef02d7ef6d	[doc] update gradient accumulation (#3771 ) * [doc]update gradient accumulation * [doc]update gradient accumulation * [doc]update gradient accumulation * [doc]update gradient accumulation * [doc]update gradient accumulation, fix * [doc]update gradient accumulation, fix * [doc]update gradient accumulation, fix * [doc]update gradient accumulation, add sidebars * [doc]update gradient accumulation, fix * [doc]update gradient accumulation, fix * [doc]update gradient accumulation, fix * [doc]update gradient accumulation, resolve comments * [doc]update gradient accumulation, resolve comments * fix	2023-05-23 10:52:30 +08:00
Frank Lee	f5c425c898	fixed the example docstring for booster (#3795 )	2023-05-22 18:10:06 +08:00
Frank Lee	788e07dbc5	[workflow] fixed the docker build workflow (#3794 ) * [workflow] fixed the docker build workflow * polish code	2023-05-22 16:30:32 +08:00
liuzeming	4d29c0f8e0	Fix/docker action (#3266 ) * [docker] Add ARG VERSION to determine the Tag * [workflow] fixed the version in the release docker workflow --------- Co-authored-by: liuzeming <liuzeming@4paradigm.com>	2023-05-22 15:04:00 +08:00

1 2 3 4 5 ...

2497 Commits (c1c672d0f0fcba484f294ad8550df59ee5448fdd) All Branches Search

2497 Commits (c1c672d0f0fcba484f294ad8550df59ee5448fdd)

All Branches