ColossalAI

Commit Graph

Author	SHA1	Message	Date
YuliangLiu0306	35e6b9ec82	[autoparallel] adapt handlers with attention block (#1990 ) * [autoparallel] adapt handlers with attention block * polish	2022-11-21 10:44:11 +08:00
Fazzie-Maqianli	b5dbb46172	[example] add diffusion inference (#1986 )	2022-11-20 18:35:29 +08:00
binmakeswell	a01278e810	Update requirements.txt	2022-11-18 18:57:18 +08:00
YuliangLiu0306	05020e50d0	[autoparallel] support more flexible data type (#1967 )	2022-11-18 17:01:06 +08:00
Jiarui Fang	5bec3b2168	[Gemini] open grad checkpoint when model building (#1984 )	2022-11-18 16:32:54 +08:00
Boyuan Yao	c26f21d365	[autoparallel] add pooling metainfo (#1968 ) * [fx] metainfo class for auto parallel * [fx] add unit test for linear metainfo * [fx] fix bwd param for linear * [fx] modify unit test * [fx] modify unit test * [fx] modify import * [fx] modify import * [fx] modify import * [fx] move meta profiler to auto parallel * [fx] add conv metainfo class * [fx] restore profiler * [fx] restore meta profiler * [autoparallel] modify unit test * [fx] modify unit test * [autoparallel] add batchnorm metainfo class * [autoparallel] fix batchnorm unit test function declaration * [fx] restore profiler * [fx] add relu metainfo class * [fx] restore profiler * [autoparallel] modify metainfo input * [autoparallel] add pooling metainfo	2022-11-18 15:13:03 +08:00
Jiarui Fang	3712ac7f90	[Gemini] add bert for MemtracerWrapper unintests (#1982 )	2022-11-18 14:58:28 +08:00
Jiarui Fang	e481489aa6	[Gemini] MemtracerWrapper unittests (#1981 )	2022-11-18 14:19:40 +08:00
mandoxzhang	52bd106627	add RoBERTa (#1980 ) * update roberta * update roberta & readme * update roberta & readme * update roberta & readme	2022-11-18 14:04:49 +08:00
Jiarui Fang	31922110ad	[Gemini] memory trace hook (#1978 )	2022-11-18 11:52:55 +08:00
Jiarui Fang	0529fcde06	[Gemini] independent runtime tracer (#1974 )	2022-11-18 10:53:42 +08:00
YuliangLiu0306	0da1d00399	[autoparallel] support distributed dataloader option (#1906 ) * [autoparallel] support distributed dataloader option * update output handler to support ddp dataloader * poish code	2022-11-17 20:11:53 +08:00
Genghan Zhang	6630d45546	[autoparallel] Add alpha beta (#1973 ) * Add alpha beta * Fix test * Fix test	2022-11-17 16:01:14 +08:00
Jiarui Fang	cc0ed7cf33	[Gemini] ZeROHookV2 -> GeminiZeROHook (#1972 )	2022-11-17 14:43:49 +08:00
ver217	f8a7148dec	[kernel] move all symlinks of kernel to `colossalai._C` (#1971 )	2022-11-17 13:42:33 +08:00
Jiarui Fang	7e24b9b9ee	[Gemini] clean no used MemTraceOp (#1970 )	2022-11-17 13:41:54 +08:00
Boyuan Yao	7c7921f71b	[autoparallel] add torch.nn.ReLU metainfo (#1868 ) * [fx] metainfo class for auto parallel * [fx] add unit test for linear metainfo * [fx] fix bwd param for linear * [fx] modify unit test * [fx] modify unit test * [fx] modify import * [fx] modify import * [fx] modify import * [fx] move meta profiler to auto parallel * [fx] add conv metainfo class * [fx] restore profiler * [fx] restore meta profiler * [autoparallel] modify unit test * [fx] modify unit test * [autoparallel] add batchnorm metainfo class * [autoparallel] fix batchnorm unit test function declaration * [fx] restore profiler * [fx] add relu metainfo class * [fx] restore profiler * [autoparallel] modify metainfo input	2022-11-16 23:12:31 +08:00
Jiarui Fang	8c66a1d0aa	[polish] remove useless file _mem_tracer_hook.py (#1963 )	2022-11-16 15:55:10 +08:00
Jiarui Fang	c4739a725a	[Gemini] polish memstats collector (#1962 )	2022-11-16 15:45:57 +08:00
YuliangLiu0306	fea3cb661c	[autoparallel] support addmm in tracer and solver (#1961 ) * [fx] patch addmm * [autoparallel] support addmm in tracer and solver	2022-11-16 14:59:18 +08:00
Jiarui Fang	f7e276fa71	[Gemini] add GeminiAdamOptimizer (#1960 )	2022-11-16 14:44:28 +08:00
HELSON	7066dfbf82	[zero] fix memory leak for zero2 (#1955 )	2022-11-16 11:43:24 +08:00
Jiarui Fang	60abd86d6a	[example] enhance GPT demo (#1959 ) * [example] enhence GPT demo * Update README.md Co-authored-by: binmakeswell <binmakeswell@gmail.com>	2022-11-16 11:36:27 +08:00
Fazzie-Maqianli	acba142929	Merge pull request #1958 from Fazziekey/lightning update model download in README	2022-11-16 11:29:21 +08:00
Fazzie	a09f88ab07	update model download in README	2022-11-16 11:17:30 +08:00
Fazzie-Maqianli	6bdd0a90ca	update lightning version (#1954 )	2022-11-15 16:57:48 +08:00
Jiarui Fang	52c6ad26e0	[ColoTensor] reconfig ColoInitContext, decouple default_pg and default_dist_spec. (#1953 )	2022-11-15 16:24:16 +08:00
zbian	598d456d0e	fixed logger	2022-11-15 16:00:07 +08:00
zbian	6877121377	updated flash attention api	2022-11-15 15:25:39 +08:00
YuliangLiu0306	36c0f3ea5b	[autoparallel] remove redundancy comm node (#1893 )	2022-11-15 10:53:41 +08:00
binmakeswell	9183e0dec5	[tutorial] polish all README (#1946 )	2022-11-14 19:49:32 +08:00
Frank Lee	de56b563b9	[tutorial] added missing dummy dataloader (#1944 )	2022-11-14 04:09:03 -06:00
Frank Lee	c6ea65011f	[tutorial] fixed pipeline bug for sequence parallel (#1943 )	2022-11-14 04:06:57 -06:00
アマデウス	e52f9d9109	[tensorparallel] fixed tp layers (#1938 )	2022-11-14 17:34:03 +08:00
Jiarui Fang	cf68cc92ac	[example] add vit (#1942 ) * [ColoTensor] ColoInitContext initialize parameters in shard mode. * polish * [example] add vit	2022-11-14 17:28:03 +08:00
YuliangLiu0306	c7925c5d08	[sc demo] add requirements to spmd README (#1941 )	2022-11-14 17:22:45 +08:00
Boyuan Yao	d5f5e06d82	[SC] remove redundant hands on (#1939 ) * [sc] SC tutorial for auto checkpoint * [sc] polish examples * [sc] polish readme * [sc] polish readme and help information * [sc] polish readme and help information * [sc] modify auto checkpoint benchmark * [sc] remove imgs * [sc] remove redundant handson	2022-11-14 03:05:21 -06:00
Jiarui Fang	9f4fb3f28a	[ColoTensor] ColoInitContext initialize parameters in shard mode. (#1937 )	2022-11-14 16:05:09 +08:00
ver217	b42b672842	[release] update version (#1931 )	2022-11-13 15:34:08 +08:00
binmakeswell	41868f7605	[tutorial] polish README and OPT files (#1930 ) * [tutorial] polish README and OPT files * [tutorial] polish README and OPT files * [tutorial] polish README and OPT files	2022-11-13 13:09:58 +08:00
ver217	b0b7a786b7	[tutorial] add synthetic dataset for opt (#1924 )	2022-11-13 03:26:11 +08:00
Frank Lee	0486048453	[tutorial] updated hybrid parallel readme (#1928 ) * [tutorial] updated hybrid parallel readme * polish code	2022-11-13 03:25:01 +08:00
Frank Lee	807cbdb87d	[tutorial] added synthetic data for sequence parallel (#1927 ) * [tutorial] added synthetic data for sequence parallel * polish code	2022-11-13 03:24:02 +08:00
Frank Lee	abf4c27f6a	[tutorial] removed huggingface model warning (#1925 )	2022-11-12 23:12:18 +08:00
Frank Lee	d43a671ad6	Hotfix/tutorial readme index (#1922 ) * [tutorial] removed tutorial index in readme * [tutorial] removed tutorial index in readme	2022-11-12 18:24:52 +08:00
Boyuan Yao	24cbee0ebe	[tutorial] modify hands-on of auto activation checkpoint (#1920 ) * [sc] SC tutorial for auto checkpoint * [sc] polish examples * [sc] polish readme * [sc] polish readme and help information * [sc] polish readme and help information * [sc] modify auto checkpoint benchmark * [sc] remove imgs	2022-11-12 18:21:03 +08:00
Frank Lee	ff16773ded	[tutorial] added synthetic data for hybrid parallel (#1921 ) * [tutorial] added synthetic data for hybrid parallel * polish code	2022-11-12 18:18:55 +08:00
Frank Lee	3c42fdbedc	[tutorial] added synthetic data for hybrid parallel (#1919 )	2022-11-12 17:49:48 +08:00
Frank Lee	1b0dd05940	[tutorial] added synthetic dataset for auto parallel demo (#1918 )	2022-11-12 17:14:32 +08:00
Frank Lee	acd9abc5ca	[tutorial] updated auto parallel demo with latest data path (#1917 )	2022-11-12 16:55:19 +08:00

1 2 3 4 5 ...

1370 Commits (35e6b9ec8297ec8ae72d2ddc53c5fdcc8768e748) All Branches Search

1370 Commits (35e6b9ec8297ec8ae72d2ddc53c5fdcc8768e748)

All Branches