ColossalAI

Commit Graph

Author	SHA1	Message	Date
ZijianYY	63cc77173b	[example] Palm adding gemini, still has bugs (#2221 )	2 years ago
HELSON	7010e18134	[example] update gpt example (#2225 )	2 years ago
Jiarui Fang	49c601da21	[example] add benchmark.sh for gpt (#2226 )	2 years ago
HELSON	3629e611cd	[example] update gpt benchmark (#2219 )	2 years ago
Jiarui Fang	54de05da5d	[builder] polish builder with better base class (#2216 ) * [builder] polish builder * remove print	2 years ago
YuliangLiu0306	3b1b91eaf4	[autoparallel] record parameter attribute in colotracer (#2217 ) * [autoparallel] record parameter attribute in collotracer * [autoparallel] fix construct_meta_info bug	2 years ago
ZijianYY	92de90dfb3	[examples] replace einsum with matmul (#2210 )	2 years ago
Jiarui Fang	7675792100	[builder] raise Error when CUDA_HOME is not set (#2213 )	2 years ago
HELSON	78a89d9b41	[diffusion] update readme (#2214 )	2 years ago
Jiarui Fang	d96cc37e32	[example] update GPT example benchmark results (#2212 )	2 years ago
Jiarui Fang	d5e3e3ec01	[example] update gpt example for larger model scale (#2211 )	2 years ago
Boyuan Yao	24246f7aa5	[autoparallel] Attach input, buffer and output tensor to MetaInfo class (#2162 ) * [fx] metainfo class for auto parallel * [fx] add unit test for linear metainfo * [fx] fix bwd param for linear * [fx] modify unit test * [fx] modify unit test * [fx] modify import * [fx] modify import * [fx] modify import * [fx] move meta profiler to auto parallel * [fx] add conv metainfo class * [fx] restore profiler * [fx] restore meta profiler * [autoparallel] modify unit test * [fx] modify unit test * [autoparallel] add batchnorm metainfo class * [autoparallel] fix batchnorm unit test function declaration * [fx] restore profiler * [fx] add relu metainfo class * [fx] restore profiler * [autoparallel] modify metainfo input * [autoparallel] add pooling metainfo * [autoparallel] add F.linear metainfo generator * [autoparallel] add binary elementwise metainfo * [fx] recover profiler * [autoparallel] fix forward memory calculation * [autoparallel] modify constants.py * [autoparallel] remove redundant print * [autoparallel] add F.conv metainfo * [autoparallel] linear fix * [autoparallel] memory estimation for communication actions * [autoparallel] fix docstring * [autoparallel] fix variables name * [autoparallel] attach tensor to metainfo class * [autoparallel] fix dangerous try except * [autoparallel] attach memory cost to shape consistency node * [autoparallel] attach shape consistency node's metainfo to the node * [autoparallel] remove todo in shape consistency memory estimation * [autoparallel] fix the annotation	2 years ago
Boyuan Yao	d0bc5a1b34	[autoparallel] new metainfoprop based on metainfo class (#2179 ) * [autoparallel] new metainfoprop to combine SPMD solver and checkpoint solver * [autoparallel] new metainfoprop to combine SPMD solver and checkpoint solver * [autoparallel] modify placeholder handler * [autoparallel] modify metainfoprop * [autoparallel] fix function typo * [autoparallel] fix placeholder handler	2 years ago
YuliangLiu0306	78509124d3	[autoparallel] update getitem handler (#2207 )	2 years ago
Jiarui Fang	29868a9ec1	[example] update gpt readme with performance (#2206 )	2 years ago
Jiarui Fang	1cb532ffec	[builder] multihead attn runtime building (#2203 ) * [hotfix] correcnt cpu_optim runtime compilation * [builder] multihead attn * fix bug * fix a bug	2 years ago
Tongping Liu	8e22c38b89	[hotfix] Fixing the bug related to ipv6 support Co-authored-by: ByteDance <tongping.liu@bytedance.com>	2 years ago
ziyuhuang123	ac85a18043	[example] polish doc (#2201 )	2 years ago
YuliangLiu0306	4851f2d607	[autoparallel] update_getattr_handler (#2193 )	2 years ago
YuliangLiu0306	f10ce01e31	[autoparallel] add gpt2 performance test code (#2194 )	2 years ago
HELSON	a3100bd50d	[testing] add beit model for unit testings (#2196 ) * [testing] add beit model * [beit] fix bugs * [beit] fix bugs * [testing] fix bugs	2 years ago
Jiarui Fang	5682e6d346	[hotfix] correcnt cpu_optim runtime compilation (#2197 )	2 years ago
BlueRum	6642cebdbe	[example] Change some training settings for diffusion (#2195 )	2 years ago
HELSON	2458659919	[zero] fix error for BEiT models (#2169 ) * [zero] fix error for BEiT models * [ColoParameter] add unpack operation for tuple arguments * fix bugs * fix chunkv2 unit testing * add assertion for gradient state	2 years ago
ziyuhuang123	4363ff3e41	'[NFC] fix some typos' (#2175 )	2 years ago
binmakeswell	04a200573c	[NFC] update news link (#2191 )	2 years ago
Jiarui Fang	355ffb386e	[builder] unified cpu_optim fused_optim inferface (#2190 )	2 years ago
Jiarui Fang	9587b080ba	[builder] use runtime builder for fused_optim (#2189 )	2 years ago
Fazzie-Maqianli	ce3c4eca7b	[example] support Dreamblooth (#2188 )	2 years ago
BlueRum	1cf6d92d7c	[exmaple] diffuser, support quant inference for stable diffusion (#2186 )	2 years ago
Jiarui Fang	bc0e271e71	[buider] use builder() for cpu adam and fused optim in setup.py (#2187 )	2 years ago
Jiarui Fang	d42afd30f8	[builder] runtime adam and fused_optim builder (#2184 )	2 years ago
YuliangLiu0306	550f8f8905	[autoparallel] integrate_gpt_related_tests (#2134 ) * [autoparallel] integrate_gpt_related_tests * polish code * polish code * add GPT2Model into runtime test	2 years ago
Ziyue Jiang	59e343328d	[Pipeline Middleware ] Fix deadlock when num_microbatch=num_stage (#2156 ) * add splitter * polish code * remove comment * fix async nan by moving to cpu first Co-authored-by: Ziyue Jiang <ziyue.jiang@gmail.com>	2 years ago
github-actions[bot]	937f404253	Automated submodule synchronization (#2136 ) Co-authored-by: github-actions <github-actions@github.com>	2 years ago
Jiarui Fang	65f56f49e8	[example] gpt demo more accuracy tflops (#2178 )	2 years ago
Tongping Liu	ab54fed292	[hotfix] add kwargs for colo_addmm (#2171 )	2 years ago
Arsmart1	a110933d65	[NFC] fix a typo 'stable-diffusion-typo-fine-tune' Co-authored-by: ziyuhuang123 <202476410@qq.com>	2 years ago
Fazzie-Maqianli	9396a18361	Merge pull request #2174 from ziyuhuang123/main 'diffusion-typo-change'	2 years ago
ziyuhuang123	cf5028363c	'diffusion-typo-change'	2 years ago
アマデウス	622f863291	[hotfix] Jit type hint #2161 (#2164 )	2 years ago
Jiarui Fang	27327a4c90	[example] add palm pytorch version (#2172 )	2 years ago
Zihao	12e7bcd720	register meta func for rnn (#2159 )	2 years ago
Boyuan Yao	cfe2a9bd90	[autoparallel] memory estimation for shape consistency (#2144 ) * [fx] metainfo class for auto parallel * [fx] add unit test for linear metainfo * [fx] fix bwd param for linear * [fx] modify unit test * [fx] modify unit test * [fx] modify import * [fx] modify import * [fx] modify import * [fx] move meta profiler to auto parallel * [fx] add conv metainfo class * [fx] restore profiler * [fx] restore meta profiler * [autoparallel] modify unit test * [fx] modify unit test * [autoparallel] add batchnorm metainfo class * [autoparallel] fix batchnorm unit test function declaration * [fx] restore profiler * [fx] add relu metainfo class * [fx] restore profiler * [autoparallel] modify metainfo input * [autoparallel] add pooling metainfo * [autoparallel] add F.linear metainfo generator * [autoparallel] add binary elementwise metainfo * [fx] recover profiler * [autoparallel] fix forward memory calculation * [autoparallel] modify constants.py * [autoparallel] remove redundant print * [autoparallel] add F.conv metainfo * [autoparallel] linear fix * [autoparallel] memory estimation for communication actions * [autoparallel] fix docstring * [autoparallel] fix variables name	2 years ago
Jiarui Fang	b87496a66b	[hotfix] fix auto policy of test_sharded_optim_v2 (#2157 )	2 years ago
YuliangLiu0306	16335cb537	[hotfix] fix aten default bug (#2158 )	2 years ago
Jiarui Fang	a4b4bb01d6	[example] update vit readme (#2155 )	2 years ago
Jiarui Fang	2cfe685b9f	[exmaple] add vit missing functions (#2154 )	2 years ago
HELSON	a7d95b7024	[example] add zero1, zero2 example in GPT examples (#2146 ) * [example] add zero1 and zero2 for GPT * update readme in gpt example * polish code * change init value * update readme	2 years ago
YuliangLiu0306	1cce6e36ca	[autoparallel] use metainfo in handler (#2149 )	2 years ago

1 2 3 4 5 ...

1534 Commits (63cc77173ba68f81983ba265eb03774afa52a0b7) All Branches Search

1534 Commits (63cc77173ba68f81983ba265eb03774afa52a0b7)

All Branches