1440 Commits (978242326ac66f1be8869adf4d5d97cbc2618891)
 

Author SHA1 Message Date
Jiarui Fang 978242326a
[Gemini] remove eval in gemini unittests! (#2092) 2 years ago
YuliangLiu0306 7f72eb0510
[autoparallel]add embedding handler (#2089) 2 years ago
Jiarui Fang 1fca5d79ea
[Gemini] remove GLOBAL_MODEL_DATA_TRACER (#2091) 2 years ago
Jiarui Fang 28e55c2530
[Gemini] remove GLOBAL_CUDA_MEM_INFO (#2090) 2 years ago
Jiarui Fang 25abae6d7f
[Gemini] use MemStats in Runtime Memory tracer (#2088) 2 years ago
Jiarui Fang 33f4412102
[Gemini] use MemStats to store the tracing data. Seperate it from Collector. (#2084) 2 years ago
Jiarui Fang 1f99205827
[Gemini] remove static tracer (#2083) 2 years ago
github-actions[bot] 28ef3f29af
Automated submodule synchronization (#1957) 2 years ago
YuliangLiu0306 0e9db368ef
[autoparallel] add tensor constructor handler (#2082) 2 years ago
YuliangLiu0306 cdf537a648
[autoparallel] add non_split linear strategy (#2078) 2 years ago
Boyuan Yao cf0268da93
[autoparallel] Add F.conv metainfo (#2069) 2 years ago
YuliangLiu0306 f123476666
[autoparallel] complete gpt block searching (#2065) 2 years ago
Ziyue Jiang 597cdd3006
[Pipeline Middleware] Adapt scheduler for Topo (#2066) 2 years ago
Jiarui Fang b3b89865e2
[Gemini] ParamOpHook -> ColoParamOpHook (#2080) 2 years ago
Jiarui Fang 4f21c9e8d9
[Gemini] polish runtime tracer tests (#2077) 2 years ago
YuliangLiu0306 677e1e20d4
[device] update flatten device mesh usage (#2079) 2 years ago
Jiarui Fang a7adad9ccb
[Gemini] rename hooks related to runtime mem tracer (#2076) 2 years ago
Jiarui Fang 40b7d55bf3
[Gemini] add albert in test models. (#2075) 2 years ago
Jiarui Fang 616ed91ecd
[test] bert test in non-distributed way (#2074) 2 years ago
Jiarui Fang 223332ff7e
[Gemini] rename ParamTracerWrapper -> RuntimeMemTracer (#2073) 2 years ago
Jiarui Fang 9f828ef36f
[Gemini] remove not used MemtracerWrapper (#2072) 2 years ago
Boyuan Yao 616da17fab
[autoparallel] add binary elementwise metainfo for auto parallel (#2058) 2 years ago
Boyuan Yao 4b40fbd743
[autoparallel] fix forward memory calculation (#2062) 2 years ago
Ziyue Jiang 44ea461890
[Pipeline] Add Topo Class (#2059) 2 years ago
YuliangLiu0306 e4293e5077
[hotfix] update test for latest version (#2060) 2 years ago
YuliangLiu0306 19438ea0ef
[hotfix] skip gpt tracing test (#2064) 2 years ago
Zihao 38ea4ba1bd
[Gemini] fix grad unreleased issue and param recovery issue (#2052) 2 years ago
YuliangLiu0306 edf4cd46c5
[examples] update autoparallel demo (#2061) 2 years ago
YuliangLiu0306 1c1fe44305
[autoparallel] adapt solver with self attention (#2037) 2 years ago
Frank Lee d3499c98d4
[release] update to 0.1.11rc5 (#2053) 2 years ago
Frank Lee ea74a3b9cc
[cli] updated installation cheheck with more inforamtion (#2050) 2 years ago
HELSON f6178728a0
[gemini] fix init bugs for modules (#2047) 2 years ago
Frank Lee 81e0da7fa8
[setup] supported conda-installed torch (#2048) 2 years ago
HELSON e37f3db40c
[gemini] add arguments (#2046) 2 years ago
Zihao 6a9158f1fa
[Gemini] free and allocate cuda memory by tensor.storage, add grad hook (#2040) 2 years ago
Jiarui Fang 1e885329f4
[test] align model name with the file name. (#2045) 2 years ago
Jiarui Fang 31c644027b
[hotfix] hotfix Gemini for no leaf modules bug (#2043) 2 years ago
HELSON 384cd26314
[zero] fix testing parameters (#2042) 2 years ago
HELSON 17a3c685b0
[zero] fix unit-tests (#2039) 2 years ago
Jiarui Fang eb7742a4bb
[Gemini] more tests for Gemini (#2038) 2 years ago
HELSON 537e181705
[testing] fix testing models (#2036) 2 years ago
HELSON a1ce02d740
[zero] test gradient accumulation (#1964) 2 years ago
Ziyue Jiang b0936e4a44
[rpc] split with dag (#2028) 2 years ago
Jiarui Fang 96134e7be3
[hotfix] add bert test for gemini fwd bwd (#2035) 2 years ago
YuliangLiu0306 0dbcd4a6f5
[autoparallel] add split handler (#2032) 2 years ago
Jiarui Fang 28aa9a4294
[Gemini] more rigorous unit tests for run_fwd_bwd (#2034) 2 years ago
YuliangLiu0306 81330b0352
[autoparallel] add experimental permute handler (#2029) 2 years ago
Zihao 95c4532fff
[Gemini] paramWrapper paramTracerHook unitest (#2030) 2 years ago
Jiarui Fang 8daf1b4db1
[Gemini] patch for supporting orch.add_ function for ColoTensor (#2003) 2 years ago
Ziyue Jiang 632753abbc
[fx]Split partition with DAG information (#2025) 2 years ago