ColossalAI

Commit Graph

Author	SHA1	Message	Date
oahzxl	e5a5fbb8a9	update source add	2022-12-31 01:00:06 +08:00
xcnick	85178a397a	[hotfix] fix error for torch 2.0 (#2243 )	2022-12-30 23:11:55 +08:00
Super Daniel	b7d0990c61	[autoparallel] fix construct meta info. (#2245 )	2022-12-30 19:56:44 +08:00
Frank Lee	89542ceb44	[doc] updated the stable diffussion on docker usage (#2244 ) * [doc] updated the stable diffussion on docker usage * polish doc	2022-12-30 18:00:20 +08:00
Jiarui Fang	50cdf5430e	[example] diffusion install from docker (#2239 ) * [builder] builder for scaled_upper_triang_masked_softmax * add missing files * fix a bug * polish code * [example] diffusion install from docker	2022-12-30 16:25:24 +08:00
Ziyue Jiang	57929a6210	fix type of num_worker_threads (#2237 ) Co-authored-by: Ziyue Jiang <ziyue.jiang@gmail.com>	2022-12-30 11:04:01 +08:00
Jiarui Fang	db4cbdc7fb	[builder] builder for scaled_upper_triang_masked_softmax (#2234 )	2022-12-30 09:58:00 +08:00
HELSON	31fe84237b	[example] fix benchmark.sh for gpt example (#2229 )	2022-12-29 23:00:14 +08:00
Super Daniel	78483a9fdd	[logger] hotfix, missing _FORMAT (#2231 )	2022-12-29 22:59:39 +08:00
oahzxl	f5515e9978	use max_mem to control stratge	2022-12-29 16:55:47 +08:00
oahzxl	f7d8092c84	align openfold	2022-12-29 16:01:05 +08:00
oahzxl	5c4df01af3	update openfold	2022-12-29 15:54:08 +08:00
oahzxl	289f3a45c2	init openfold	2022-12-29 15:01:15 +08:00
oahzxl	efe6fe3a33	code style	2022-12-29 14:47:47 +08:00
oahzxl	7a23deb584	code style	2022-12-29 14:47:16 +08:00
oahzxl	5a916c0adb	add print	2022-12-29 14:42:29 +08:00
oahzxl	1d7ca02301	add benchmark	2022-12-29 14:28:38 +08:00
Jiarui Fang	2cdecc9f38	[example] make palm + GeminiDPP work (#2227 )	2022-12-29 14:28:31 +08:00
ZijianYY	63cc77173b	[example] Palm adding gemini, still has bugs (#2221 )	2022-12-29 14:01:09 +08:00
HELSON	7010e18134	[example] update gpt example (#2225 )	2022-12-29 12:01:45 +08:00
Jiarui Fang	49c601da21	[example] add benchmark.sh for gpt (#2226 )	2022-12-29 12:00:00 +08:00
oahzxl	fff493c202	init openfold	2022-12-29 11:48:11 +08:00
oahzxl	69af93107f	add evoformer openfold init	2022-12-29 11:28:25 +08:00
HELSON	3629e611cd	[example] update gpt benchmark (#2219 )	2022-12-29 10:51:42 +08:00
Jiarui Fang	54de05da5d	[builder] polish builder with better base class (#2216 ) * [builder] polish builder * remove print	2022-12-28 19:45:49 +08:00
YuliangLiu0306	3b1b91eaf4	[autoparallel] record parameter attribute in colotracer (#2217 ) * [autoparallel] record parameter attribute in collotracer * [autoparallel] fix construct_meta_info bug	2022-12-28 19:29:08 +08:00
ZijianYY	92de90dfb3	[examples] replace einsum with matmul (#2210 )	2022-12-28 19:03:06 +08:00
Jiarui Fang	7675792100	[builder] raise Error when CUDA_HOME is not set (#2213 )	2022-12-28 16:07:08 +08:00
HELSON	78a89d9b41	[diffusion] update readme (#2214 )	2022-12-28 16:06:48 +08:00
Jiarui Fang	d96cc37e32	[example] update GPT example benchmark results (#2212 )	2022-12-28 14:28:12 +08:00
Jiarui Fang	d5e3e3ec01	[example] update gpt example for larger model scale (#2211 )	2022-12-28 13:54:08 +08:00
Boyuan Yao	24246f7aa5	[autoparallel] Attach input, buffer and output tensor to MetaInfo class (#2162 ) * [fx] metainfo class for auto parallel * [fx] add unit test for linear metainfo * [fx] fix bwd param for linear * [fx] modify unit test * [fx] modify unit test * [fx] modify import * [fx] modify import * [fx] modify import * [fx] move meta profiler to auto parallel * [fx] add conv metainfo class * [fx] restore profiler * [fx] restore meta profiler * [autoparallel] modify unit test * [fx] modify unit test * [autoparallel] add batchnorm metainfo class * [autoparallel] fix batchnorm unit test function declaration * [fx] restore profiler * [fx] add relu metainfo class * [fx] restore profiler * [autoparallel] modify metainfo input * [autoparallel] add pooling metainfo * [autoparallel] add F.linear metainfo generator * [autoparallel] add binary elementwise metainfo * [fx] recover profiler * [autoparallel] fix forward memory calculation * [autoparallel] modify constants.py * [autoparallel] remove redundant print * [autoparallel] add F.conv metainfo * [autoparallel] linear fix * [autoparallel] memory estimation for communication actions * [autoparallel] fix docstring * [autoparallel] fix variables name * [autoparallel] attach tensor to metainfo class * [autoparallel] fix dangerous try except * [autoparallel] attach memory cost to shape consistency node * [autoparallel] attach shape consistency node's metainfo to the node * [autoparallel] remove todo in shape consistency memory estimation * [autoparallel] fix the annotation	2022-12-28 13:37:40 +08:00
Boyuan Yao	d0bc5a1b34	[autoparallel] new metainfoprop based on metainfo class (#2179 ) * [autoparallel] new metainfoprop to combine SPMD solver and checkpoint solver * [autoparallel] new metainfoprop to combine SPMD solver and checkpoint solver * [autoparallel] modify placeholder handler * [autoparallel] modify metainfoprop * [autoparallel] fix function typo * [autoparallel] fix placeholder handler	2022-12-28 13:35:08 +08:00
YuliangLiu0306	78509124d3	[autoparallel] update getitem handler (#2207 )	2022-12-27 19:58:32 +08:00
Jiarui Fang	29868a9ec1	[example] update gpt readme with performance (#2206 )	2022-12-27 17:39:53 +08:00
Jiarui Fang	1cb532ffec	[builder] multihead attn runtime building (#2203 ) * [hotfix] correcnt cpu_optim runtime compilation * [builder] multihead attn * fix bug * fix a bug	2022-12-27 16:06:09 +08:00
oahzxl	cb2dd1a106	turn off print mem	2022-12-27 15:01:58 +08:00
oahzxl	a2b4755ce9	code style	2022-12-27 14:49:52 +08:00
oahzxl	6be89a3b82	add chunksize in emit, fix bug in reassgin shape	2022-12-27 14:48:25 +08:00
Tongping Liu	8e22c38b89	[hotfix] Fixing the bug related to ipv6 support Co-authored-by: ByteDance <tongping.liu@bytedance.com>	2022-12-27 12:42:46 +08:00
ziyuhuang123	ac85a18043	[example] polish doc (#2201 )	2022-12-27 10:04:01 +08:00
oahzxl	378a49dc6c	code style	2022-12-27 09:48:59 +08:00
oahzxl	8f5a0edfab	add chunk select	2022-12-26 23:08:49 +08:00
YuliangLiu0306	4851f2d607	[autoparallel] update_getattr_handler (#2193 )	2022-12-26 21:57:39 +08:00
YuliangLiu0306	f10ce01e31	[autoparallel] add gpt2 performance test code (#2194 )	2022-12-26 21:56:58 +08:00
HELSON	a3100bd50d	[testing] add beit model for unit testings (#2196 ) * [testing] add beit model * [beit] fix bugs * [beit] fix bugs * [testing] fix bugs	2022-12-26 17:35:36 +08:00
Jiarui Fang	5682e6d346	[hotfix] correcnt cpu_optim runtime compilation (#2197 )	2022-12-26 16:45:14 +08:00
oahzxl	1b8a066592	add chunk select class	2022-12-26 15:28:01 +08:00
BlueRum	6642cebdbe	[example] Change some training settings for diffusion (#2195 )	2022-12-26 15:22:20 +08:00
HELSON	2458659919	[zero] fix error for BEiT models (#2169 ) * [zero] fix error for BEiT models * [ColoParameter] add unpack operation for tuple arguments * fix bugs * fix chunkv2 unit testing * add assertion for gradient state	2022-12-26 15:03:54 +08:00

... 3 4 5 6 7 ...

1817 Commits (867c8c2d3a90bbf55a5bedba80a3aeabe0299d0f) All Branches Search

1817 Commits (867c8c2d3a90bbf55a5bedba80a3aeabe0299d0f)

All Branches