Commit Graph

1373 Commits (d00d905b8601b4e163d81f5d71c4254f462f847e)
 

Author SHA1 Message Date
binmakeswell d00d905b86
[NFC] polish license (#1999)
2 years ago
Jiarui Fang a2d3266648
[hotfix] make Gemini work for conv DNN (#1998)
2 years ago
YuliangLiu0306 155891113e
[autoparallel] use pytree map style to process data (#1989)
2 years ago
YuliangLiu0306 35e6b9ec82
[autoparallel] adapt handlers with attention block (#1990)
2 years ago
Fazzie-Maqianli b5dbb46172
[example] add diffusion inference (#1986)
2 years ago
binmakeswell a01278e810
Update requirements.txt
2 years ago
YuliangLiu0306 05020e50d0
[autoparallel] support more flexible data type (#1967)
2 years ago
Jiarui Fang 5bec3b2168
[Gemini] open grad checkpoint when model building (#1984)
2 years ago
Boyuan Yao c26f21d365
[autoparallel] add pooling metainfo (#1968)
2 years ago
Jiarui Fang 3712ac7f90
[Gemini] add bert for MemtracerWrapper unintests (#1982)
2 years ago
Jiarui Fang e481489aa6
[Gemini] MemtracerWrapper unittests (#1981)
2 years ago
mandoxzhang 52bd106627
add RoBERTa (#1980)
2 years ago
Jiarui Fang 31922110ad
[Gemini] memory trace hook (#1978)
2 years ago
Jiarui Fang 0529fcde06
[Gemini] independent runtime tracer (#1974)
2 years ago
YuliangLiu0306 0da1d00399
[autoparallel] support distributed dataloader option (#1906)
2 years ago
Genghan Zhang 6630d45546
[autoparallel] Add alpha beta (#1973)
2 years ago
Jiarui Fang cc0ed7cf33
[Gemini] ZeROHookV2 -> GeminiZeROHook (#1972)
2 years ago
ver217 f8a7148dec
[kernel] move all symlinks of kernel to `colossalai._C` (#1971)
2 years ago
Jiarui Fang 7e24b9b9ee
[Gemini] clean no used MemTraceOp (#1970)
2 years ago
Boyuan Yao 7c7921f71b
[autoparallel] add torch.nn.ReLU metainfo (#1868)
2 years ago
Jiarui Fang 8c66a1d0aa
[polish] remove useless file _mem_tracer_hook.py (#1963)
2 years ago
Jiarui Fang c4739a725a
[Gemini] polish memstats collector (#1962)
2 years ago
YuliangLiu0306 fea3cb661c
[autoparallel] support addmm in tracer and solver (#1961)
2 years ago
Jiarui Fang f7e276fa71
[Gemini] add GeminiAdamOptimizer (#1960)
2 years ago
HELSON 7066dfbf82
[zero] fix memory leak for zero2 (#1955)
2 years ago
Jiarui Fang 60abd86d6a
[example] enhance GPT demo (#1959)
2 years ago
Fazzie-Maqianli acba142929
Merge pull request #1958 from Fazziekey/lightning
2 years ago
Fazzie a09f88ab07 update model download in README
2 years ago
Fazzie-Maqianli 6bdd0a90ca
update lightning version (#1954)
2 years ago
Jiarui Fang 52c6ad26e0
[ColoTensor] reconfig ColoInitContext, decouple default_pg and default_dist_spec. (#1953)
2 years ago
zbian 598d456d0e fixed logger
2 years ago
zbian 6877121377 updated flash attention api
2 years ago
YuliangLiu0306 36c0f3ea5b
[autoparallel] remove redundancy comm node (#1893)
2 years ago
binmakeswell 9183e0dec5
[tutorial] polish all README (#1946)
2 years ago
Frank Lee de56b563b9
[tutorial] added missing dummy dataloader (#1944)
2 years ago
Frank Lee c6ea65011f
[tutorial] fixed pipeline bug for sequence parallel (#1943)
2 years ago
アマデウス e52f9d9109
[tensorparallel] fixed tp layers (#1938)
2 years ago
Jiarui Fang cf68cc92ac
[example] add vit (#1942)
2 years ago
YuliangLiu0306 c7925c5d08
[sc demo] add requirements to spmd README (#1941)
2 years ago
Boyuan Yao d5f5e06d82
[SC] remove redundant hands on (#1939)
2 years ago
Jiarui Fang 9f4fb3f28a
[ColoTensor] ColoInitContext initialize parameters in shard mode. (#1937)
2 years ago
ver217 b42b672842
[release] update version (#1931)
2 years ago
binmakeswell 41868f7605
[tutorial] polish README and OPT files (#1930)
2 years ago
ver217 b0b7a786b7
[tutorial] add synthetic dataset for opt (#1924)
2 years ago
Frank Lee 0486048453
[tutorial] updated hybrid parallel readme (#1928)
2 years ago
Frank Lee 807cbdb87d
[tutorial] added synthetic data for sequence parallel (#1927)
2 years ago
Frank Lee abf4c27f6a
[tutorial] removed huggingface model warning (#1925)
2 years ago
Frank Lee d43a671ad6
Hotfix/tutorial readme index (#1922)
2 years ago
Boyuan Yao 24cbee0ebe
[tutorial] modify hands-on of auto activation checkpoint (#1920)
2 years ago
Frank Lee ff16773ded
[tutorial] added synthetic data for hybrid parallel (#1921)
2 years ago