ColossalAI

Commit Graph

Author	SHA1	Message	Date
Frank Lee	f8a0e7fb01	Merge pull request #412 from hpcaitech/develop merge develop to main	2022-03-14 22:48:56 +08:00
Jiarui Fang	21dc54e019	[zero] memtracer to record cuda memory usage of model data and overall system (#395 )	2022-03-14 22:05:30 +08:00
Jiarui Fang	a37bf1bc42	[hotfix] rm test_tensor_detector.py (#413 )	2022-03-14 21:39:48 +08:00
Jiarui Fang	370f567e7d	[zero] new interface for ShardedOptimv2 (#406 )	2022-03-14 20:48:41 +08:00
LuGY	a9c27be42e	Added tensor detector (#393 ) * Added tensor detector * Added the - states * Allowed change include_cpu when detect()	2022-03-14 18:01:46 +08:00
Frank Lee	32296cf462	Merge pull request #409 from 1SAA/develop [hotfix] fixed error when no collective communication in CommProfiler	2022-03-14 17:43:45 +08:00
1SAA	907ac4a2dc	fixed error when no collective communication in CommProfiler	2022-03-14 17:21:00 +08:00
Frank Lee	62b08acc72	update hf badge link (#410 )	2022-03-14 17:07:01 +08:00
Frank Lee	2fe68b359a	Merge pull request #403 from ver217/feature/shard-strategy [zero] Add bucket tensor shard strategy	2022-03-14 16:29:28 +08:00
Frank Lee	cf92a779dc	added huggingface badge (#407 )	2022-03-14 16:23:02 +08:00
HELSON	dfd0363f68	polished output format for communication profiler and pcie profiler (#404 ) fixed typing error	2022-03-14 16:07:45 +08:00
ver217	63469c0f91	polish code	2022-03-14 15:48:55 +08:00
ver217	54fd37f0e0	polish unit test	2022-03-14 15:06:02 +08:00
ver217	88804aee49	add bucket tensor shard strategy	2022-03-14 14:48:32 +08:00
Frank Lee	aaead33cfe	Merge pull request #397 from hpcaitech/create-pull-request/patch-sync-submodule [Bot] Synchronize Submodule References	2022-03-14 10:11:06 +08:00
github-actions	6098bc4cce	Automated submodule synchronization	2022-03-14 00:01:12 +00:00
Frank Lee	6937f85004	Merge pull request #402 from oikosohn/oikosohn-patch-1 fix typo in CHANGE_LOG.md	2022-03-13 22:40:04 +08:00
sohn	ff4f5d7231	fix typo in CHANGE_LOG.md - fix typo, `Unifed` -> `Unified` below Added	2022-03-13 23:34:34 +09:00
Frank Lee	fc5101f24c	Merge pull request #401 from hpcaitech/develop	2022-03-13 11:09:17 +08:00
Frank Lee	fc2fd0abe5	Merge pull request #400 from hpcaitech/hotfix/readme fixed broken badge link	2022-03-13 09:12:59 +08:00
Frank Lee	6d3a4f51bf	fixed broken badge link	2022-03-13 09:11:48 +08:00
HELSON	7c079d9c33	[hotfix] fixed bugs in ShardStrategy and PcieProfiler (#394 )	2022-03-11 18:12:46 +08:00
Frank Lee	1e4bf85cdb	fixed bug in activation checkpointing test (#387 )	2022-03-11 15:50:28 +08:00
Jiarui Fang	3af13a2c3e	[zero] polish ShardedOptimV2 unittest (#385 ) * place params on cpu after zero init context * polish code * bucketzed cpu gpu tensor transter * find a bug in sharded optim unittest * add offload unittest for ShardedOptimV2. * polish code and make it more robust	2022-03-11 15:50:28 +08:00
binmakeswell	ce7b2c9ae3	update README and images path (#384 )	2022-03-11 15:50:28 +08:00
ScalableEKNN	2fcd4f38ee	fix format (#379 )	2022-03-11 15:50:28 +08:00
Jiang Zhuo	5a4a3b77d9	fix format (#376 )	2022-03-11 15:50:28 +08:00
lucasliunju	ce886a9062	fix format (#374 )	2022-03-11 15:50:28 +08:00
Frank Lee	526a318032	[unit test] Refactored test cases with component func (#339 ) * refactored test with component func * fixed bug	2022-03-11 15:50:28 +08:00
LuGY	de46450461	Added activation offload (#331 ) * Added activation offload * Fixed the import bug, used the pytest	2022-03-11 15:50:28 +08:00
Jiarui Fang	272ebfb57d	[bug] shard param during initializing the ShardedModelV2 (#381 )	2022-03-11 15:50:28 +08:00
HELSON	8c18eb0998	[profiler] Fixed bugs in CommProfiler and PcieProfiler (#377 )	2022-03-11 15:50:28 +08:00
Jiarui Fang	b5f43acee3	[zero] find miss code (#378 )	2022-03-11 15:50:28 +08:00
Jiarui Fang	6b6002962a	[zero] zero init context collect numel of model (#375 )	2022-03-11 15:50:28 +08:00
HELSON	1ed7c24c02	Added PCIE profiler to dectect data transmission (#373 )	2022-03-11 15:50:28 +08:00
jiaruifang	d9217e1960	Revert "[zero] bucketized tensor cpu gpu copy (#368 )" This reverts commit `bef05489b6`.	2022-03-11 15:50:28 +08:00
Xue Fuzhao	a8cd5e8e81	Update README-zh-Hans.md (#367 ) Fuzhao updated	2022-03-11 15:50:28 +08:00
Shen Chenhui	1c88dd43e2	Fix/format (#366 )	2022-03-11 15:50:28 +08:00
Ziheng Qin	0db43fa995	fix format (#364 )	2022-03-11 15:50:28 +08:00
RichardoLuo	8539898ec6	flake8 style change (#363 )	2022-03-11 15:50:28 +08:00
Kai Wang (Victor Kai)	53bb3bcc0a	fix format (#362 )	2022-03-11 15:50:28 +08:00
ziyu huang	a77d73f22b	fix format parallel_context.py (#359 ) Co-authored-by: huangziyu <202476410arsmart@gmail.com>	2022-03-11 15:50:28 +08:00
Zangwei	c695369af0	fix format constants.py (#358 )	2022-03-11 15:50:28 +08:00
Yuer867	4a0f8c2c50	fix format parallel_2p5d (#357 )	2022-03-11 15:50:28 +08:00
Liang Bowen	7eb87f516d	flake8 style (#352 )	2022-03-11 15:50:28 +08:00
Xu Kai	54ee8d1254	Fix/format colossalai/engine/paramhooks/(#350 )	2022-03-11 15:50:28 +08:00
Maruyama_Aya	e83970e3dc	fix format ColossalAI\colossalai\context\process_group_initializer	2022-03-11 15:50:28 +08:00
yuxuan-lou	3b88eb2259	Flake8 code restyle	2022-03-11 15:50:28 +08:00
xyupeng	af801cb4df	fix format setup.py (#343 )	2022-03-11 15:50:28 +08:00
xuqifan897	148207048e	Qifan formated file ColossalAI\colossalai\nn\layer\parallel_1d\layers.py (#342 )	2022-03-11 15:50:28 +08:00

... 50 51 52 53 54 ...

2755 Commits (50e5602c2d6c8e25ad544cbecc38649e5257e7b8) All Branches Search

2755 Commits (50e5602c2d6c8e25ad544cbecc38649e5257e7b8)

All Branches