535 Commits (cf68cc92accd5f0a2538b24e03f1f4f857b69fb9)

Author SHA1 Message Date
Jiarui Fang 9f4fb3f28a
[ColoTensor] ColoInitContext initialize parameters in shard mode. (#1937) 2 years ago
HELSON 6e51d296f0
[zero] migrate zero1&2 (#1878) 2 years ago
Jiarui Fang 51597f6a28
[hotfix] pass test_complete_workflow (#1877) 2 years ago
Jiarui Fang 986f8cbaa7
[inference] overlap comm and compute in Linear1D_Row when stream_chunk_num > 1 (#1876) 2 years ago
YuliangLiu0306 1b494ad73c
[autoparallel] fix linear logical convert issue (#1857) 2 years ago
Jiarui Fang c2947dadf1
[inference] streaming Linear 1D Row inference (#1874) 2 years ago
xcnick a141681260
[amp] add torch amp test (#1860) 2 years ago
Frank Lee e6ec99d389
[utils] fixed lazy init context (#1867) 2 years ago
Jiarui Fang 3ce4463fe6
[utils] remove lazy_memory_allocate from ColoInitContext (#1844) 2 years ago
YuliangLiu0306 f6032ddb17
[autoparallel] fix bias addition module (#1800) 2 years ago
ver217 99870726b1
[CheckpointIO] a uniform checkpoint I/O module (#1689) 2 years ago
Boyuan Yao 629172b319
[autoparallel] add batch norm metainfo (#1815) 2 years ago
Super Daniel 441d584e4a
[fx] add a symbolic_trace api. (#1812) 2 years ago
Jiarui Fang 6fa71d65d3
[fx] skip diffusers unitest if it is not installed (#1799) 2 years ago
oahzxl 9639ea88fc
[kernel] more flexible flashatt interface (#1804) 2 years ago
Boyuan Yao 327d07c44a
[autoparallel] add conv metainfo class for auto parallel (#1796) 2 years ago
oahzxl 501a9e9cd2
[hotfix] polish flash attention (#1802) 2 years ago
Jiarui Fang c248800359
[kernel] skip tests of flash_attn and triton when they are not available (#1798) 2 years ago
YuliangLiu0306 e34e850a4c
[autoparallel]add essential CommActions for broadcast oprands (#1793) 2 years ago
Boyuan Yao 05ce3d369f
[fx] Add linear metainfo class for auto parallel (#1783) 2 years ago
YuliangLiu0306 2c4c7b3618
[autoparallel] add getattr handler (#1767) 2 years ago
HELSON c6a1a62636
[hotfix] fix zero's incompatibility with checkpoint in torch-1.12 (#1786) 2 years ago
Jiarui Fang 32c1b843a9
skip torchrec unittests if not installed (#1790) 2 years ago
kurisusnowdeng 0b8161fab8 updated tp layers 2 years ago
YuliangLiu0306 e859380bf7
[fx] support module with bias addition (#1780) 2 years ago
Frank Lee f3f19a5c47
[autoparallel] added matmul handler (#1763) 2 years ago
YuliangLiu0306 27de252334
[autoparallel] fix conv handler numerical test (#1771) 2 years ago
Super Daniel 1e88811c7a
[autoparallel] move ckpt solvers to autoparallel folder / refactor code (#1764) 2 years ago
YuliangLiu0306 a4d1f59c78
[autoparallel] add numerical test for handlers (#1769) 2 years ago
YuliangLiu0306 b0f7c8bde8
[autoparallel] update CommSpec to CommActions (#1768) 2 years ago
YuliangLiu0306 b4cc59b61e
[autoparallel] add numerical test for node strategies (#1760) 2 years ago
oahzxl 25952b67d7
[feat] add flash attention (#1762) 2 years ago
Super Daniel 0584654c79
[fx] refactor memory utils and extend shard utils. (#1754) 2 years ago
YuliangLiu0306 314d8c497f
[autoparallel] refactor the runtime apply pass and add docstring to passes (#1757) 2 years ago
Frank Lee f9a613d660
[autoparallel] added binary elementwise node handler (#1758) 2 years ago
YuliangLiu0306 d2fc067231
[autoparallel] fix param hook issue in transform pass (#1755) 2 years ago
Frank Lee 262652c8bc
[autoparallel] added addbmm handler (#1751) 2 years ago
YuliangLiu0306 980ed21723
[autoparallel] shard param and buffer as expected (#1753) 2 years ago
YuliangLiu0306 cdb7d5e7d2
[hotfix] autoparallel unit test (#1752) 2 years ago
YuliangLiu0306 a4ce180e85
[autoparallel] add sequential order to communication actions (#1735) 2 years ago
Super Daniel b893342f95
[fx] test tracer on diffuser modules. (#1750) 2 years ago
Frank Lee b80b6eaa88
[autoparallel] recovered skipped test cases (#1748) 2 years ago
Frank Lee 474111ecb5
[autoparallel] fixed wrong sharding strategy in conv handler (#1747) 2 years ago
Frank Lee 8b8937d901
[autoparallel] fixed wrong generated strategy for dot op (#1746) 2 years ago
Frank Lee 88a79814fb
[autoparallel] handled illegal strategy in node handler (#1743) 2 years ago
Super Daniel 30874f1692
[fx/profiler] debug the fx.profiler / add an example test script for fx.profiler (#1730) 2 years ago
Frank Lee eee84908d4
[autoparallel] handled illegal sharding strategy (#1728) 2 years ago
Ziheng Qin cbe9a4cb45 [NFC] polish tests/test_layers/test_3d/test_3d.py code style (#1740) 2 years ago
lucasliunju 912eb58ea0 [NFC] polish tests/test_layers/test_3d/checks_3d/common.py code style (#1733) 2 years ago
Xue Fuzhao 754aa7c81f [NFC] polish tests/test_layers/test_3d/checks_3d/check_layer_3d.py code style (#1731) 2 years ago