ColossalAI

Commit Graph

Author	SHA1	Message	Date
digger yu	385e85afd4	[hotfix] fix typo s/keywrods/keywords etc. (#5429 )	9 months ago
Youngon	68f55a709c	[hotfix] fix stable diffusion inference bug. (#5289 ) * Update train_ddp.yaml delete "strategy" to fix DDP config loading bug in "main.py" * Update train_ddp.yaml fix inference with scripts/txt2img.py config file load bug. * Update README.md add pretrain model test code.	9 months ago
MickeyCHAN	e304e4db35	[hotfix] fix sd vit import error (#5420 ) * fix import error * Update dpt_depth.py --------- Co-authored-by: binmakeswell <binmakeswell@gmail.com>	9 months ago
Hongxin Liu	070df689e6	[devops] fix extention building (#5427 )	9 months ago
Hongxin Liu	d202cc28c0	[npu] change device to accelerator api (#5239 ) * update accelerator * fix timer * fix amp * update * fix * update bug * add error raise * fix autocast * fix set device * remove doc accelerator * update doc * update doc * update doc * use nullcontext * update cpu * update null context * change time limit for example * udpate * update * update * update * [npu] polish accelerator code --------- Co-authored-by: Xuanlei Zhao <xuanlei.zhao@gmail.com> Co-authored-by: zxl <43881818+oahzxl@users.noreply.github.com>	11 months ago
binmakeswell	822051d888	[doc] update slack link (#4823 )	1 year ago
Baizhou Zhang	df66741f77	[bug] fix get_default_parser in examples (#4764 )	1 year ago
Hongxin Liu	079bf3cb26	[misc] update pre-commit and run all files (#4752 ) * [misc] update pre-commit * [misc] run pre-commit * [misc] remove useless configuration files * [misc] ignore cuda for clang-format	1 year ago
Hongxin Liu	b5f9e37c70	[legacy] clean up legacy code (#4743 ) * [legacy] remove outdated codes of pipeline (#4692) * [legacy] remove cli of benchmark and update optim (#4690) * [legacy] remove cli of benchmark and update optim * [doc] fix cli doc test * [legacy] fix engine clip grad norm * [legacy] remove outdated colo tensor (#4694) * [legacy] remove outdated colo tensor * [test] fix test import * [legacy] move outdated zero to legacy (#4696) * [legacy] clean up utils (#4700) * [legacy] clean up utils * [example] update examples * [legacy] clean up amp * [legacy] fix amp module * [legacy] clean up gpc (#4742) * [legacy] clean up context * [legacy] clean core, constants and global vars * [legacy] refactor initialize * [example] fix examples ci * [example] fix examples ci * [legacy] fix tests * [example] fix gpt example * [example] fix examples ci * [devops] fix ci installation * [example] fix examples ci	1 year ago
Baizhou Zhang	295b38fecf	[example] update vit example for hybrid parallel plugin (#4641 ) * update vit example for hybrid plugin * reset tp/pp size * fix dataloader iteration bug * update optimizer passing in evaluation/add grad_accum * change criterion * wrap tqdm * change grad_accum to grad_checkpoint * fix pbar	1 year ago
ChengDaqi2023	8e2e1992b8	[example] update streamlit 0.73.1 to 1.11.1 (#4386 )	1 year ago
Hongxin Liu	27061426f7	[gemini] improve compatibility and add static placement policy (#4479 ) * [gemini] remove distributed-related part from colotensor (#4379) * [gemini] remove process group dependency * [gemini] remove tp part from colo tensor * [gemini] patch inplace op * [gemini] fix param op hook and update tests * [test] remove useless tests * [test] remove useless tests * [misc] fix requirements * [test] fix model zoo * [test] fix model zoo * [test] fix model zoo * [test] fix model zoo * [test] fix model zoo * [misc] update requirements * [gemini] refactor gemini optimizer and gemini ddp (#4398) * [gemini] update optimizer interface * [gemini] renaming gemini optimizer * [gemini] refactor gemini ddp class * [example] update gemini related example * [example] update gemini related example * [plugin] fix gemini plugin args * [test] update gemini ckpt tests * [gemini] fix checkpoint io * [example] fix opt example requirements * [example] fix opt example * [example] fix opt example * [example] fix opt example * [gemini] add static placement policy (#4443) * [gemini] add static placement policy * [gemini] fix param offload * [test] update gemini tests * [plugin] update gemini plugin * [plugin] update gemini plugin docstr * [misc] fix flash attn requirement * [test] fix gemini checkpoint io test * [example] update resnet example result (#4457) * [example] update bert example result (#4458) * [doc] update gemini doc (#4468) * [example] update gemini related examples (#4473) * [example] update gpt example * [example] update dreambooth example * [example] update vit * [example] update opt * [example] update palm * [example] update vit and opt benchmark * [hotfix] fix bert in model zoo (#4480) * [hotfix] fix bert in model zoo * [test] remove chatglm gemini test * [test] remove sam gemini test * [test] remove vit gemini test * [hotfix] fix opt tutorial example (#4497) * [hotfix] fix opt tutorial example * [hotfix] fix opt tutorial example	1 year ago
caption	16c0acc01b	[hotfix] update gradio 3.11 to 3.34.0 (#4329 )	1 year ago
Jianghai	31dc302017	[examples] copy resnet example to image (#4090 ) * copy resnet example * add pytest package * skip test_ci * skip test_ci * skip test_ci	1 year ago
Baizhou Zhang	b3ab7fbabf	[example] update ViT example using booster api (#3940 )	1 year ago
Liu Ziming	e277534a18	Merge pull request #3905 from MaruyamaAya/dreambooth [example] Adding an example of training dreambooth with the new booster API	1 year ago
digger yu	33eef714db	fix typo examples and docs (#3932 )	1 year ago
Maruyama_Aya	9b5e7ce21f	modify shell for check	1 year ago
Maruyama_Aya	730a092ba2	modify shell for check	1 year ago
Maruyama_Aya	49567d56d1	modify shell for check	1 year ago
Maruyama_Aya	039854b391	modify shell for check	1 year ago
Maruyama_Aya	cf4792c975	modify shell for check	1 year ago
Maruyama_Aya	c94a33579b	modify shell for check	2 years ago
Maruyama_Aya	4fc8bc68ac	modify file path	2 years ago
Maruyama_Aya	b4437e88c3	fixed port	2 years ago
Maruyama_Aya	79c9f776a9	fixed port	2 years ago
Maruyama_Aya	d3379f0be7	fixed model saving bugs	2 years ago
Maruyama_Aya	b29e1f0722	change directory	2 years ago
digger yu	518b31c059	[docs] change placememt_policy to placement_policy (#3829 ) * fix typo colossalai/autochunk auto_parallel amp * fix typo colossalai/auto_parallel nn utils etc. * fix typo colossalai/auto_parallel autochunk fx/passes etc. * fix typo docs/ * change placememt_policy to placement_policy in docs/ and examples/	2 years ago
digger-yu	b9a8dff7e5	[doc] Fix typo under colossalai and doc(#3618 ) * Fixed several spelling errors under colossalai * Fix the spelling error in colossalai and docs directory * Cautious Changed the spelling error under the example folder * Update runtime_preparation_pass.py revert autograft to autograd * Update search_chunk.py utile to until * Update check_installation.py change misteach to mismatch in line 91 * Update 1D_tensor_parallel.md revert to perceptron * Update 2D_tensor_parallel.md revert to perceptron in line 73 * Update 2p5D_tensor_parallel.md revert to perceptron in line 71 * Update 3D_tensor_parallel.md revert to perceptron in line 80 * Update README.md revert to resnet in line 42 * Update reorder_graph.py revert to indice in line 7 * Update p2p.py revert to megatron in line 94 * Update initialize.py revert to torchrun in line 198 * Update routers.py change to detailed in line 63 * Update routers.py change to detailed in line 146 * Update README.md revert random number in line 402	2 years ago
natalie_cao	de84c0311a	Polish Code	2 years ago
NatalieC323	fb8fae6f29	Revert "[dreambooth] fixing the incompatibity in requirements.txt (#3190 ) (#3378 )" (#3481 )	2 years ago
NatalieC323	c701b77b11	[dreambooth] fixing the incompatibity in requirements.txt (#3190 ) (#3378 ) * Update requirements.txt * Update environment.yaml * Update README.md * Update environment.yaml * Update README.md * Update README.md * Delete requirements_colossalai.txt * Update requirements.txt * Update README.md	2 years ago
Frank Lee	80eba05b0a	[test] refactor tests with spawn (#3452 ) * [test] added spawn decorator * polish code * polish code * polish code * polish code * polish code * polish code	2 years ago
ver217	26b7aac0be	[zero] reorganize zero/gemini folder structure (#3424 ) * [zero] refactor low-level zero folder structure * [zero] fix legacy zero import path * [zero] fix legacy zero import path * [zero] remove useless import * [zero] refactor gemini folder structure * [zero] refactor gemini folder structure * [zero] refactor legacy zero import path * [zero] refactor gemini folder structure * [zero] refactor gemini folder structure * [zero] refactor gemini folder structure * [zero] refactor legacy zero import path * [zero] fix test import path * [zero] fix test * [zero] fix circular import * [zero] update import	2 years ago
Jan Roudaut	dd367ce795	[doc] polish diffusion example (#3386 ) * [examples/images/diffusion]: README.md: typo fixes * Update README.md * Grammar fixes * Reformulated "Step 3" (xformers) introduction to the cost => at the cost + reworded pip availability.	2 years ago
Jan Roudaut	51cd2fec57	Typofix: malformed `xformers` version (#3384 ) s/0.12.0/0.0.12/	2 years ago
NatalieC323	280fcdc485	polish code (#3194 ) Co-authored-by: YuliangLiu0306 <72588413+YuliangLiu0306@users.noreply.github.com>	2 years ago
NatalieC323	e5f668f280	[dreambooth] fixing the incompatibity in requirements.txt (#3190 ) * Update requirements.txt * Update environment.yaml * Update README.md * Update environment.yaml * Update README.md * Update README.md * Delete requirements_colossalai.txt * Update requirements.txt * Update README.md	2 years ago
NatalieC323	4e921cfbd6	[examples] Solving the diffusion issue of incompatibility issue#3169 (#3170 ) * Update requirements.txt * Update environment.yaml * Update README.md * Update environment.yaml	2 years ago
binmakeswell	3c01280a56	[doc] add community contribution guide (#3153 ) * [doc] update contribution guide * [doc] update contribution guide * [doc] add community contribution guide	2 years ago
Fazzie-Maqianli	5d5f475d75	[diffusers] fix ci and docker (#3085 )	2 years ago
Camille Zhong	e58a3c804c	Fix the version of lightning and colossalai in Stable Diffusion environment requirement (#3073 ) 1. Modify the README of stable diffusion 2. Fix the version of pytorch lightning&lightning and colossalai version to enable codes running successfully.	2 years ago
Haofan Wang	47ecb22387	[example] add LoRA support (#2821 ) * add lora * format	2 years ago
Fazzie-Maqianli	ba84cd80b2	fix pip install colossal (#2764 )	2 years ago
Fazzie-Maqianli	d03f4429c1	add ci (#2641 )	2 years ago
Fazzie-Maqianli	292c81ed7c	fix/transformer-verison (#2581 )	2 years ago
Fazzie	cad1f50512	fix ckpt	2 years ago
Fazzie	f35326881c	fix README	2 years ago
Jiarui Fang	fd8d19a6e7	[example] update lightning dependency for stable diffusion (#2522 )	2 years ago

1 2

96 Commits (6df844b8c4946c734115b7e180b292888d857bc1)