ColossalAI

Commit Graph

Author	SHA1	Message	Date
Jiarui Fang	7675792100	[builder] raise Error when CUDA_HOME is not set (#2213 )	2022-12-28 16:07:08 +08:00
HELSON	78a89d9b41	[diffusion] update readme (#2214 )	2022-12-28 16:06:48 +08:00
Jiarui Fang	d96cc37e32	[example] update GPT example benchmark results (#2212 )	2022-12-28 14:28:12 +08:00
Jiarui Fang	d5e3e3ec01	[example] update gpt example for larger model scale (#2211 )	2022-12-28 13:54:08 +08:00
Boyuan Yao	24246f7aa5	[autoparallel] Attach input, buffer and output tensor to MetaInfo class (#2162 ) * [fx] metainfo class for auto parallel * [fx] add unit test for linear metainfo * [fx] fix bwd param for linear * [fx] modify unit test * [fx] modify unit test * [fx] modify import * [fx] modify import * [fx] modify import * [fx] move meta profiler to auto parallel * [fx] add conv metainfo class * [fx] restore profiler * [fx] restore meta profiler * [autoparallel] modify unit test * [fx] modify unit test * [autoparallel] add batchnorm metainfo class * [autoparallel] fix batchnorm unit test function declaration * [fx] restore profiler * [fx] add relu metainfo class * [fx] restore profiler * [autoparallel] modify metainfo input * [autoparallel] add pooling metainfo * [autoparallel] add F.linear metainfo generator * [autoparallel] add binary elementwise metainfo * [fx] recover profiler * [autoparallel] fix forward memory calculation * [autoparallel] modify constants.py * [autoparallel] remove redundant print * [autoparallel] add F.conv metainfo * [autoparallel] linear fix * [autoparallel] memory estimation for communication actions * [autoparallel] fix docstring * [autoparallel] fix variables name * [autoparallel] attach tensor to metainfo class * [autoparallel] fix dangerous try except * [autoparallel] attach memory cost to shape consistency node * [autoparallel] attach shape consistency node's metainfo to the node * [autoparallel] remove todo in shape consistency memory estimation * [autoparallel] fix the annotation	2022-12-28 13:37:40 +08:00
Boyuan Yao	d0bc5a1b34	[autoparallel] new metainfoprop based on metainfo class (#2179 ) * [autoparallel] new metainfoprop to combine SPMD solver and checkpoint solver * [autoparallel] new metainfoprop to combine SPMD solver and checkpoint solver * [autoparallel] modify placeholder handler * [autoparallel] modify metainfoprop * [autoparallel] fix function typo * [autoparallel] fix placeholder handler	2022-12-28 13:35:08 +08:00
YuliangLiu0306	78509124d3	[autoparallel] update getitem handler (#2207 )	2022-12-27 19:58:32 +08:00
Jiarui Fang	29868a9ec1	[example] update gpt readme with performance (#2206 )	2022-12-27 17:39:53 +08:00
Jiarui Fang	1cb532ffec	[builder] multihead attn runtime building (#2203 ) * [hotfix] correcnt cpu_optim runtime compilation * [builder] multihead attn * fix bug * fix a bug	2022-12-27 16:06:09 +08:00
oahzxl	cb2dd1a106	turn off print mem	2022-12-27 15:01:58 +08:00
oahzxl	a2b4755ce9	code style	2022-12-27 14:49:52 +08:00
oahzxl	6be89a3b82	add chunksize in emit, fix bug in reassgin shape	2022-12-27 14:48:25 +08:00
Tongping Liu	8e22c38b89	[hotfix] Fixing the bug related to ipv6 support Co-authored-by: ByteDance <tongping.liu@bytedance.com>	2022-12-27 12:42:46 +08:00
ziyuhuang123	ac85a18043	[example] polish doc (#2201 )	2022-12-27 10:04:01 +08:00
oahzxl	378a49dc6c	code style	2022-12-27 09:48:59 +08:00
oahzxl	8f5a0edfab	add chunk select	2022-12-26 23:08:49 +08:00
YuliangLiu0306	4851f2d607	[autoparallel] update_getattr_handler (#2193 )	2022-12-26 21:57:39 +08:00
YuliangLiu0306	f10ce01e31	[autoparallel] add gpt2 performance test code (#2194 )	2022-12-26 21:56:58 +08:00
HELSON	a3100bd50d	[testing] add beit model for unit testings (#2196 ) * [testing] add beit model * [beit] fix bugs * [beit] fix bugs * [testing] fix bugs	2022-12-26 17:35:36 +08:00
Jiarui Fang	5682e6d346	[hotfix] correcnt cpu_optim runtime compilation (#2197 )	2022-12-26 16:45:14 +08:00
oahzxl	1b8a066592	add chunk select class	2022-12-26 15:28:01 +08:00
BlueRum	6642cebdbe	[example] Change some training settings for diffusion (#2195 )	2022-12-26 15:22:20 +08:00
HELSON	2458659919	[zero] fix error for BEiT models (#2169 ) * [zero] fix error for BEiT models * [ColoParameter] add unpack operation for tuple arguments * fix bugs * fix chunkv2 unit testing * add assertion for gradient state	2022-12-26 15:03:54 +08:00
ziyuhuang123	4363ff3e41	'[NFC] fix some typos' (#2175 )	2022-12-25 18:41:39 +08:00
binmakeswell	04a200573c	[NFC] update news link (#2191 )	2022-12-24 11:53:52 +08:00
Jiarui Fang	355ffb386e	[builder] unified cpu_optim fused_optim inferface (#2190 )	2022-12-23 20:57:41 +08:00
oahzxl	786a398a6b	code style	2022-12-23 17:42:51 +08:00
oahzxl	9b1b890347	update run	2022-12-23 17:32:11 +08:00
oahzxl	51ef8384c1	finish node reorder	2022-12-23 17:25:36 +08:00
Jiarui Fang	9587b080ba	[builder] use runtime builder for fused_optim (#2189 )	2022-12-23 17:07:03 +08:00
oahzxl	884a228ea6	reorder nodes	2022-12-23 17:06:07 +08:00
Fazzie-Maqianli	ce3c4eca7b	[example] support Dreamblooth (#2188 )	2022-12-23 16:47:30 +08:00
BlueRum	1cf6d92d7c	[exmaple] diffuser, support quant inference for stable diffusion (#2186 )	2022-12-23 16:06:29 +08:00
Jiarui Fang	bc0e271e71	[buider] use builder() for cpu adam and fused optim in setup.py (#2187 )	2022-12-23 16:05:13 +08:00
oahzxl	e0ae68e736	code style	2022-12-23 15:49:04 +08:00
oahzxl	fa5e6fbf96	code style	2022-12-23 15:38:37 +08:00
oahzxl	4f5e105af3	remove flow tracer	2022-12-23 15:34:41 +08:00
oahzxl	4d89525fc2	remove abandoned function	2022-12-23 14:28:49 +08:00
oahzxl	49ba619085	code style	2022-12-23 14:26:43 +08:00
oahzxl	d309e9338b	adapt codegen to prepose node	2022-12-23 14:26:12 +08:00
Jiarui Fang	d42afd30f8	[builder] runtime adam and fused_optim builder (#2184 )	2022-12-23 14:14:21 +08:00
oahzxl	522f017418	code style	2022-12-23 13:41:51 +08:00
oahzxl	774d34f1aa	refactor flow search	2022-12-23 13:41:10 +08:00
YuliangLiu0306	550f8f8905	[autoparallel] integrate_gpt_related_tests (#2134 ) * [autoparallel] integrate_gpt_related_tests * polish code * polish code * add GPT2Model into runtime test	2022-12-23 12:36:59 +08:00
Ziyue Jiang	59e343328d	[Pipeline Middleware ] Fix deadlock when num_microbatch=num_stage (#2156 ) * add splitter * polish code * remove comment * fix async nan by moving to cpu first Co-authored-by: Ziyue Jiang <ziyue.jiang@gmail.com>	2022-12-23 11:38:43 +08:00
github-actions[bot]	937f404253	Automated submodule synchronization (#2136 ) Co-authored-by: github-actions <github-actions@github.com>	2022-12-23 09:34:48 +08:00
Jiarui Fang	65f56f49e8	[example] gpt demo more accuracy tflops (#2178 )	2022-12-22 20:51:35 +08:00
Tongping Liu	ab54fed292	[hotfix] add kwargs for colo_addmm (#2171 )	2022-12-22 13:25:30 +08:00
Arsmart1	a110933d65	[NFC] fix a typo 'stable-diffusion-typo-fine-tune' Co-authored-by: ziyuhuang123 <202476410@qq.com>	2022-12-22 10:57:05 +08:00
Fazzie-Maqianli	9396a18361	Merge pull request #2174 from ziyuhuang123/main 'diffusion-typo-change'	2022-12-22 10:37:55 +08:00

... 7 8 9 10 11 ...

1990 Commits (648183a96037a0d9e758154f98e1e1b8004eea0b) All Branches Search

1990 Commits (648183a96037a0d9e758154f98e1e1b8004eea0b)

All Branches