ColossalAI

Commit Graph

Author	SHA1	Message	Date
Hongxin Liu	554aa9592e	[legacy] move communication and nn to legacy and refactor logger (#4671 ) * [legacy] move communication to legacy (#4640) * [legacy] refactor logger and clean up legacy codes (#4654) * [legacy] make logger independent to gpc * [legacy] make optim independent to registry * [legacy] move test engine to legacy * [legacy] move nn to legacy (#4656) * [legacy] move nn to legacy * [checkpointio] fix save hf config * [test] remove useledd rpc pp test * [legacy] fix nn init * [example] skip tutorial hybriad parallel example * [devops] test doc check * [devops] test doc check	2023-09-11 16:24:28 +08:00
Hongxin Liu	ac178ca5c1	[legacy] move builder and registry to legacy (#4603 )	2023-09-05 21:53:10 +08:00
YeAnbang	3883db452c	[NFC] polish unary_elementwise_generator.py code style (#4267 ) Co-authored-by: aye42 <aye42@gatech.edu>	2023-07-26 14:12:57 +08:00
Frank Lee	c4b1b65931	[test] fixed tests failed due to dtensor change (#4082 ) * [test] fixed tests failed due to dtensor change * polish code	2023-07-04 16:05:01 +08:00
digger yu	e2d81eba0d	[nfc] fix typo colossalai/ applications/ (#3831 ) * fix typo colossalai/autochunk auto_parallel amp * fix typo colossalai/auto_parallel nn utils etc. * fix typo colossalai/auto_parallel autochunk fx/passes etc. * fix typo docs/ * change placememt_policy to placement_policy in docs/ and examples/ * fix typo colossalai/ applications/	2023-05-25 16:19:41 +08:00
digger yu	7f8203af69	fix typo colossalai/auto_parallel autochunk fx/passes etc. (#3808 )	2023-05-24 09:01:50 +08:00
digger yu	9265f2d4d7	[NFC]fix typo colossalai/auto_parallel nn utils etc. (#3779 ) * fix typo colossalai/autochunk auto_parallel amp * fix typo colossalai/auto_parallel nn utils etc.	2023-05-23 15:28:20 +08:00
digger yu	1baeb39c72	[NFC] fix typo with colossalai/auto_parallel/tensor_shard (#3742 ) * fix typo applications/ and colossalai/ date 5.11 * fix typo colossalai/	2023-05-17 11:13:23 +08:00
YuliangLiu0306	ffcdbf0f65	[autoparallel]integrate auto parallel feature with new tracer (#3408 ) * [autoparallel] integrate new analyzer in module level * unify the profiling method * polish * fix no codegen bug * fix pass bug * fix liveness test * polish	2023-04-04 17:40:45 +08:00
YuliangLiu0306	fee2af8610	[autoparallel] adapt autoparallel with new analyzer (#3261 ) * [autoparallel] adapt autoparallel with new analyzer * fix all node handler tests * polish * polish	2023-03-30 17:47:24 +08:00
YuliangLiu0306	47fb214b3b	[hotfix] add shard dim to aviod backward communication error (#2954 )	2023-03-01 11:41:53 +08:00
YuliangLiu0306	1dc003c169	[autoparallel] distinguish different parallel strategies (#2699 )	2023-02-15 22:28:28 +08:00
YuliangLiu0306	21d6a48f4d	[autoparallel] add shard option (#2696 ) * [autoparallel] add shard option * polish	2023-02-15 13:48:28 +08:00
Boyuan Yao	0385b26ebf	[autoparallel] Patch meta information of `torch.nn.LayerNorm` (#2647 ) * [autoparallel] layernorm metainfo patch * [autoparallel] polish test	2023-02-10 14:29:24 +08:00
YuliangLiu0306	37df666f38	[autoparallel] refactor handlers which reshape input tensors (#2615 ) * [autoparallel] refactor handlers which reshape input tensors * polish	2023-02-08 15:02:49 +08:00
YuliangLiu0306	cb3d1bef62	[autoparallel] adapt autoparallel tests with latest api (#2626 )	2023-02-08 15:02:12 +08:00
Boyuan Yao	90a9fdd91d	[autoparallel] Patch meta information of `torch.matmul` (#2584 ) * [autoparallel] matmul metainfo * [auto_parallel] remove unused print * [tests] skip test_matmul_handler when torch version is lower than 1.12.0	2023-02-08 11:05:31 +08:00
YuliangLiu0306	aa0f6686f9	[autoparallel] accelerate gpt2 training (#2495 )	2023-01-29 11:13:15 +08:00
YuliangLiu0306	8221fd7485	[autoparallel] update binary elementwise handler (#2451 ) * [autoparallel] update binary elementwise handler * polish	2023-01-12 09:35:10 +08:00
YuliangLiu0306	41429b9b28	[autoparallel] add shard option (#2423 )	2023-01-11 13:40:33 +08:00
Zirui Zhu	1c29b173c9	[NFC] polish colossalai/auto_parallel/tensor_shard/node_handler/getitem_handler.py code style (#2289 )	2023-01-04 15:09:57 +08:00
Boyuan Yao	d45695d94e	Merge pull request #2258 from hpcaitech/debug/ckpt-autoparallel [autockpt] provide option for activation checkpoint search in SPMD solver	2023-01-04 11:37:28 +08:00
Boyuan Yao	b904748210	[autoparallel] bypass MetaInfo when unavailable and modify BCAST_FUNC_OP metainfo (#2293 ) * [autoparallel] align the data_ptr with the old version of auto activation checkpoint pipeline * [autoparallel] using fwd_time and bwd_time instead of fwd_flop and bwd_flop * [autoparallel] specifycomm nodes' memory cost in construct chain * [autoparallel] fix wrong runtime apply calculation * [autoparallel] fix wrong runtime apply calculation * [autoparallel] fix wrong runtime apply calculation * [autoparallel] bypass metainfo when available and modify BCAST_FUNC_OP	2023-01-03 20:28:01 +08:00
YuliangLiu0306	fb87322773	[autoparallel] fix spelling error (#2270 )	2023-01-03 16:13:00 +08:00
Super Daniel	3ccf58aa76	[autockpt] make it work. (#2257 )	2023-01-02 23:37:45 +08:00
Boyuan Yao	ab38aebace	[autoparallel] Hook all meta information on ResNet nodes for auto activation checkpoint (#2248 ) * [autoparallel] hook node meta on graph nodes for checkpoint solver * [autoparallel] polish code * [autoparallel] restore some node handlers * colossalai/auto_parallel/passes/meta_info_prop.py * [autoparallel] remove some unused import * [autoparallel] hook bwd_mem_out	2023-01-02 16:25:18 +08:00
Boyuan Yao	d0bc5a1b34	[autoparallel] new metainfoprop based on metainfo class (#2179 ) * [autoparallel] new metainfoprop to combine SPMD solver and checkpoint solver * [autoparallel] new metainfoprop to combine SPMD solver and checkpoint solver * [autoparallel] modify placeholder handler * [autoparallel] modify metainfoprop * [autoparallel] fix function typo * [autoparallel] fix placeholder handler	2022-12-28 13:35:08 +08:00
YuliangLiu0306	78509124d3	[autoparallel] update getitem handler (#2207 )	2022-12-27 19:58:32 +08:00
YuliangLiu0306	4851f2d607	[autoparallel] update_getattr_handler (#2193 )	2022-12-26 21:57:39 +08:00
YuliangLiu0306	1cce6e36ca	[autoparallel] use metainfo in handler (#2149 )	2022-12-20 10:31:22 +08:00
YuliangLiu0306	a3c6924deb	[autoparallel] process size nodes in runtime pass (#2130 ) * [autoparallel] process size nodes in runtime pass * polish code	2022-12-14 16:10:50 +08:00
YuliangLiu0306	536560ccc0	[autoparallel] implement softmax handler (#2132 )	2022-12-14 16:09:53 +08:00
YuliangLiu0306	d3d4630495	[autoparallel] add sum handler (#2101 )	2022-12-08 17:02:54 +08:00
YuliangLiu0306	3af7e65dea	[autoparallel] complete gpt related module search (#2097 )	2022-12-08 10:04:09 +08:00
YuliangLiu0306	7f72eb0510	[autoparallel]add embedding handler (#2089 ) * [autoparallel] add embedding handler * fix bugs	2022-12-07 09:41:46 +08:00
YuliangLiu0306	0e9db368ef	[autoparallel] add tensor constructor handler (#2082 )	2022-12-06 10:20:10 +08:00
YuliangLiu0306	cdf537a648	[autoparallel] add non_split linear strategy (#2078 ) * [autoparallel] add non_split linear stategy * polish	2022-12-06 10:19:33 +08:00
YuliangLiu0306	f123476666	[autoparallel] complete gpt block searching (#2065 ) * [autoparallel] complete gpt block searching * fix test	2022-12-06 10:17:10 +08:00
YuliangLiu0306	0dbcd4a6f5	[autoparallel] add split handler (#2032 ) * [autoparallel] add split handler * add numerical test and runtime passes	2022-11-29 11:03:51 +08:00
YuliangLiu0306	81330b0352	[autoparallel] add experimental permute handler (#2029 )	2022-11-27 20:26:52 +08:00
YuliangLiu0306	ea0f6b8df9	[autoparallel] add runtime pass and numerical test for view handler (#2018 )	2022-11-25 15:50:16 +08:00
YuliangLiu0306	1438993113	[autoparallel] add experimental view handler (#2011 ) * [autoparallel] add experimental view handler * polish * polish * polish code * rename variables	2022-11-24 11:34:41 +08:00
YuliangLiu0306	155891113e	[autoparallel] use pytree map style to process data (#1989 )	2022-11-21 10:44:22 +08:00
YuliangLiu0306	35e6b9ec82	[autoparallel] adapt handlers with attention block (#1990 ) * [autoparallel] adapt handlers with attention block * polish	2022-11-21 10:44:11 +08:00
YuliangLiu0306	05020e50d0	[autoparallel] support more flexible data type (#1967 )	2022-11-18 17:01:06 +08:00
YuliangLiu0306	0da1d00399	[autoparallel] support distributed dataloader option (#1906 ) * [autoparallel] support distributed dataloader option * update output handler to support ddp dataloader * poish code	2022-11-17 20:11:53 +08:00
YuliangLiu0306	fea3cb661c	[autoparallel] support addmm in tracer and solver (#1961 ) * [fx] patch addmm * [autoparallel] support addmm in tracer and solver	2022-11-16 14:59:18 +08:00
YuliangLiu0306	36c0f3ea5b	[autoparallel] remove redundancy comm node (#1893 )	2022-11-15 10:53:41 +08:00
YuliangLiu0306	1b494ad73c	[autoparallel] fix linear logical convert issue (#1857 )	2022-11-10 17:19:22 +08:00
YuliangLiu0306	49216d7ab1	[autoparallel] fix bugs caused by negative dim key (#1808 ) * [autoparallel] fix bugs caused by negative dim key * fix import error * fix matmul test issue * fix unit test issue	2022-11-08 17:03:50 +08:00

1 2

68 Commits (50e5602c2d6c8e25ad544cbecc38649e5257e7b8)