86 Commits (191daf74111251df7327f4ef7a069d0254554d2c)

Author SHA1 Message Date
YuliangLiu0306 4269196c79
[hotfix] skip auto checkpointing tests (#3029) 2 years ago
Ziyue Jiang 44ea461890
[Pipeline] Add Topo Class (#2059) 2 years ago
YuliangLiu0306 19438ea0ef
[hotfix] skip gpt tracing test (#2064) 2 years ago
Ziyue Jiang 632753abbc
[fx]Split partition with DAG information (#2025) 2 years ago
Jiarui Fang 51597f6a28
[hotfix] pass test_complete_workflow (#1877) 2 years ago
Jiarui Fang c2947dadf1
[inference] streaming Linear 1D Row inference (#1874) 2 years ago
Frank Lee e6ec99d389
[utils] fixed lazy init context (#1867) 2 years ago
Super Daniel 441d584e4a
[fx] add a symbolic_trace api. (#1812) 2 years ago
Jiarui Fang 6fa71d65d3
[fx] skip diffusers unitest if it is not installed (#1799) 2 years ago
YuliangLiu0306 e34e850a4c
[autoparallel]add essential CommActions for broadcast oprands (#1793) 2 years ago
Jiarui Fang 32c1b843a9
skip torchrec unittests if not installed (#1790) 2 years ago
YuliangLiu0306 e859380bf7
[fx] support module with bias addition (#1780) 2 years ago
Super Daniel 1e88811c7a
[autoparallel] move ckpt solvers to autoparallel folder / refactor code (#1764) 2 years ago
Super Daniel 0584654c79
[fx] refactor memory utils and extend shard utils. (#1754) 2 years ago
Super Daniel b893342f95
[fx] test tracer on diffuser modules. (#1750) 2 years ago
Super Daniel 30874f1692
[fx/profiler] debug the fx.profiler / add an example test script for fx.profiler (#1730) 2 years ago
Super Daniel 393f594051
[fx/meta/rpc] move _meta_registration.py to fx folder / register fx functions with compatibility checks / remove color debug (#1710) 2 years ago
Boyuan Yao 31d2f03d27
[autoparallel] fix C version rotor inconsistency (#1691) 2 years ago
Super Daniel 3dd6994427
[fx/profiler] assigned UUID to each unrecorded tensor/ improved performance on GPT-2 (#1679) 2 years ago
Boyuan Yao 1df98d5b66
[autoparallel] add rotor C version (#1658) 2 years ago
Frank Lee 30e50c8b4a
[autoparallel] implemented all matmul strategy generator (#1650) 2 years ago
Boyuan Yao 5d0fdb9cb4
[fx] fix offload codegen test (#1648) 2 years ago
Boyuan Yao d6b01feb66
[fx] Modify offload codegen (#1618) 2 years ago
Super Daniel d967779a32
[fx/profiler] tuned the calculation of memory estimation (#1619) 2 years ago
Boyuan Yao 933b6c6367
[fx] Add pofo solver (#1608) 2 years ago
Boyuan Yao a7cda6f57d
[fx] Add offload codegen (#1598) 2 years ago
Super Daniel c8e9b2ad78
[hotfix/rotor] fix variable names (#1597) 2 years ago
Boyuan Yao f3687e4ee2
[fx] Add nested checkpoint in activation checkpoint codegen (#1585) 2 years ago
Super Daniel 4f59693207
[fx] provide a stable but not accurate enough version of profiler. (#1547) 2 years ago
Boyuan Yao 56159049e8
[fx] Modify solver linearize and add corresponding test (#1531) 2 years ago
Super Daniel 7dc53237c3
[fx] add test for meta tensor. (#1527) 2 years ago
YuliangLiu0306 4b3d6caeb3
[fx]patch nn.functional convolution (#1528) 2 years ago
Boyuan Yao b231430bcb
[fx] Fix wrong index in annotation and minimal flops in ckpt solver (#1521) 2 years ago
Boyuan Yao 4acc58ee20
[fx] Fix activation codegen dealing with checkpointing first op (#1510) 2 years ago
Boyuan Yao de1e716dc4
[fx] Add activation checkpoint solver rotor (#1496) 2 years ago
Frank Lee 3da68d6b1b
[fx] fixed adapative pooling size concatenation error (#1489) 2 years ago
Super Daniel 32efe8e740
[fx] add profiler for fx nodes. (#1480) 2 years ago
Boyuan Yao 1f2e547f7a
[fx] Fix ckpt functions' definitions in forward (#1476) 2 years ago
Super Daniel e7383f578b
[fx] add rules to linearize computation graphs for searching. (#1461) 2 years ago
Boyuan Yao 092b9c8f49
[fx] Add use_reentrant=False to checkpoint in codegen (#1463) 2 years ago
Super Daniel 0dbd61c29b
[fx] fix test and algorithm bugs in activation checkpointing. (#1451) 2 years ago
Frank Lee 5a52e21fe3
[test] fixed the activation codegen test (#1447) 2 years ago
Boyuan Yao 5774fe0270
[fx] Use colossalai checkpoint and add offload recognition in codegen (#1439) 2 years ago
Super Daniel d40a9392ba
[fx] fix the false interpretation of algorithm 3 in https://arxiv.org/abs/1604.06174. (#1446) 2 years ago
Super Daniel 3b26516c69
[fx] add vanilla activation checkpoint search with test on resnet and densenet (#1433) 2 years ago
Super Daniel f20cb4e893
[fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages (#1425) 2 years ago
Frank Lee adf5054ff8
[fx] fixed torchaudio conformer tracing (#1392) 2 years ago
Frank Lee 7d6293927f
[fx] patched torch.max and data movement operator (#1391) 2 years ago
Super Daniel be229217ce
[fx] add torchaudio test (#1369) 2 years ago
Boyuan Yao bb640ec728
[fx] Add colotracer compatibility test on torchrec (#1370) 2 years ago