690 Commits (4af31d263dd12c9238607fa48e5fd0488cd8cf25)

Author SHA1 Message Date
YuliangLiu0306 f123476666
[autoparallel] complete gpt block searching (#2065) 2 years ago
Ziyue Jiang 597cdd3006
[Pipeline Middleware] Adapt scheduler for Topo (#2066) 2 years ago
Jiarui Fang 4f21c9e8d9
[Gemini] polish runtime tracer tests (#2077) 2 years ago
Jiarui Fang a7adad9ccb
[Gemini] rename hooks related to runtime mem tracer (#2076) 2 years ago
Jiarui Fang 40b7d55bf3
[Gemini] add albert in test models. (#2075) 2 years ago
Jiarui Fang 616ed91ecd
[test] bert test in non-distributed way (#2074) 2 years ago
Jiarui Fang 223332ff7e
[Gemini] rename ParamTracerWrapper -> RuntimeMemTracer (#2073) 2 years ago
Jiarui Fang 9f828ef36f
[Gemini] remove not used MemtracerWrapper (#2072) 2 years ago
Boyuan Yao 616da17fab
[autoparallel] add binary elementwise metainfo for auto parallel (#2058) 2 years ago
Ziyue Jiang 44ea461890
[Pipeline] Add Topo Class (#2059) 2 years ago
YuliangLiu0306 e4293e5077
[hotfix] update test for latest version (#2060) 2 years ago
YuliangLiu0306 19438ea0ef
[hotfix] skip gpt tracing test (#2064) 2 years ago
Zihao 38ea4ba1bd
[Gemini] fix grad unreleased issue and param recovery issue (#2052) 2 years ago
YuliangLiu0306 1c1fe44305
[autoparallel] adapt solver with self attention (#2037) 2 years ago
HELSON f6178728a0
[gemini] fix init bugs for modules (#2047) 2 years ago
Zihao 6a9158f1fa
[Gemini] free and allocate cuda memory by tensor.storage, add grad hook (#2040) 2 years ago
Jiarui Fang 1e885329f4
[test] align model name with the file name. (#2045) 2 years ago
Jiarui Fang 31c644027b
[hotfix] hotfix Gemini for no leaf modules bug (#2043) 2 years ago
HELSON 384cd26314
[zero] fix testing parameters (#2042) 2 years ago
HELSON 17a3c685b0
[zero] fix unit-tests (#2039) 2 years ago
Jiarui Fang eb7742a4bb
[Gemini] more tests for Gemini (#2038) 2 years ago
HELSON 537e181705
[testing] fix testing models (#2036) 2 years ago
HELSON a1ce02d740
[zero] test gradient accumulation (#1964) 2 years ago
Ziyue Jiang b0936e4a44
[rpc] split with dag (#2028) 2 years ago
Jiarui Fang 96134e7be3
[hotfix] add bert test for gemini fwd bwd (#2035) 2 years ago
YuliangLiu0306 0dbcd4a6f5
[autoparallel] add split handler (#2032) 2 years ago
Jiarui Fang 28aa9a4294
[Gemini] more rigorous unit tests for run_fwd_bwd (#2034) 2 years ago
YuliangLiu0306 81330b0352
[autoparallel] add experimental permute handler (#2029) 2 years ago
Zihao 95c4532fff
[Gemini] paramWrapper paramTracerHook unitest (#2030) 2 years ago
Jiarui Fang 8daf1b4db1
[Gemini] patch for supporting orch.add_ function for ColoTensor (#2003) 2 years ago
Ziyue Jiang 632753abbc
[fx]Split partition with DAG information (#2025) 2 years ago
YuliangLiu0306 ea0f6b8df9
[autoparallel] add runtime pass and numerical test for view handler (#2018) 2 years ago
Jiarui Fang 2e9cbfca12
[Gemini] add unitests to check gemini correctness (#2015) 2 years ago
Jiarui Fang 0b0d8f9e17
[hotfix] revert bug PRs (#2016) 2 years ago
Zihao 0160a62a3c
[Gemini] param_tracer_wrapper and test case (#2009) 2 years ago
YuliangLiu0306 1438993113
[autoparallel] add experimental view handler (#2011) 2 years ago
Genghan Zhang d655eea515
[autoparallel] mix gather (#1977) 2 years ago
Jiarui Fang 3d907faede
[Gemini] add an inline_op_module to common test models and polish unitests. (#2004) 2 years ago
Boyuan Yao 6cd784ffee
[autoparallel] Add metainfo support for F.linear (#1987) 2 years ago
YuliangLiu0306 35e6b9ec82
[autoparallel] adapt handlers with attention block (#1990) 2 years ago
Jiarui Fang 5bec3b2168
[Gemini] open grad checkpoint when model building (#1984) 2 years ago
Boyuan Yao c26f21d365
[autoparallel] add pooling metainfo (#1968) 2 years ago
Jiarui Fang 3712ac7f90
[Gemini] add bert for MemtracerWrapper unintests (#1982) 2 years ago
Jiarui Fang e481489aa6
[Gemini] MemtracerWrapper unittests (#1981) 2 years ago
YuliangLiu0306 0da1d00399
[autoparallel] support distributed dataloader option (#1906) 2 years ago
Genghan Zhang 6630d45546
[autoparallel] Add alpha beta (#1973) 2 years ago
ver217 f8a7148dec
[kernel] move all symlinks of kernel to `colossalai._C` (#1971) 2 years ago
Boyuan Yao 7c7921f71b
[autoparallel] add torch.nn.ReLU metainfo (#1868) 2 years ago
YuliangLiu0306 fea3cb661c
[autoparallel] support addmm in tracer and solver (#1961) 2 years ago
Jiarui Fang f7e276fa71
[Gemini] add GeminiAdamOptimizer (#1960) 2 years ago