Commit Graph

2180 Commits (31c78f2be3272a9a4062fe78eca34b3847a0c900)
 

Author SHA1 Message Date
Frank Lee ea0b52c12e
[doc] specified operating system requirement (#3019)
2 years ago
ver217 378d827c6b
[doc] update nvme offload doc (#3014)
2 years ago
Fazzie-Maqianli c21b11edce
change nn to models (#3032)
2 years ago
YuliangLiu0306 4269196c79
[hotfix] skip auto checkpointing tests (#3029)
2 years ago
Frank Lee 8fedc8766a
[workflow] supported conda package installation in doc test (#3028)
2 years ago
Frank Lee 2cd6ba3098
[workflow] fixed the post-commit failure when no formatting needed (#3020)
2 years ago
Frank Lee 2e427ddf42
[revert] recover "[refactor] restructure configuration files (#2977)" (#3022)
2 years ago
github-actions[bot] e86d9bb2e1
[format] applied code formatting on changed files in pull request 3025 (#3026)
2 years ago
YuliangLiu0306 cd2b0eaa8d
[DTensor] refactor sharding spec (#2987)
2 years ago
Ziyue Jiang 400f63012e
[pipeline] Add Simplified Alpa DP Partition (#2507)
2 years ago
Super Daniel b42d3d28ed
[fx] remove depreciated algorithms. (#2312) (#2313)
2 years ago
BlueRum 55dcd3051a
[chatgpt] fix readme (#3025)
2 years ago
LuGY 287d60499e
[chatgpt] Add saving ckpt callback for PPO (#2880)
2 years ago
BlueRum e588703454
[chatgpt]fix inference model load (#2988)
2 years ago
github-actions[bot] 82503a96f2
[format] applied code formatting on changed files in pull request 2997 (#3008)
2 years ago
binmakeswell 52a5078988
[doc] add ISC tutorial (#2997)
2 years ago
Saurav Maheshkar 35c8f4ce47
[refactor] restructure configuration files (#2977)
2 years ago
ver217 823f3b9cf4
[doc] add deepspeed citation and copyright (#2996)
2 years ago
Frank Lee e0a1c1321c
[doc] added reference to related works (#2994)
2 years ago
Yasyf Mohamedali 19fa0e57f6
Remove extraneous comma (#2993)
2 years ago
Frank Lee 3a5d93bc2c
[kernel] cached the op kernel and fixed version check (#2886)
2 years ago
ver217 0ff8406b00
[chatgpt] allow shard init and display warning (#2986)
2 years ago
BlueRum f5ca0397dd
[chatgpt] fix lora gemini conflict in RM training (#2984)
2 years ago
ver217 19ad49fb3b
[chatgpt] making experience support dp (#2971)
2 years ago
github-actions[bot] 827a0af8cc
Automated submodule synchronization (#2982)
2 years ago
binmakeswell 9b4ceefc21
[doc] update news (#2983)
2 years ago
BlueRum c9e27f0d1b
[chatgpt]fix lora bug (#2974)
2 years ago
BlueRum 82149e9d1b
[chatgpt] fix inference demo loading bug (#2969)
2 years ago
Fazzie-Maqianli bbf9c827c3
[ChatGPT] fix README (#2966)
2 years ago
binmakeswell b0a8766381
[doc] fix chatgpt inference typo (#2964)
2 years ago
github-actions[bot] 0d07514988
Automated submodule synchronization (#2951)
2 years ago
YuliangLiu0306 e414e4092b
[DTensor] implementation of dtensor (#2946)
2 years ago
BlueRum 489a9566af
[chatgpt]add inference example (#2944)
2 years ago
YuliangLiu0306 47fb214b3b
[hotfix] add shard dim to aviod backward communication error (#2954)
2 years ago
ver217 090f14fd6b
[misc] add reference (#2930)
2 years ago
github-actions[bot] dca98937f8
[format] applied code formatting on changed files in pull request 2933 (#2939)
2 years ago
binmakeswell 8264cd7ef1
[doc] add env scope (#2933)
2 years ago
Frank Lee b8804aa60c
[doc] added readme for documentation (#2935)
2 years ago
Frank Lee 9e3b8b7aff
[doc] removed read-the-docs (#2932)
2 years ago
Frank Lee 77b88a3849
[workflow] added auto doc test on PR (#2929)
2 years ago
YuliangLiu0306 197d0bf4ed
[autoparallel] apply repeat block to reduce solving time (#2912)
2 years ago
YH a848091141
Fix port exception type (#2925)
2 years ago
zbian 61e687831d fixed using zero with tp cannot access weight correctly
2 years ago
github-actions[bot] eb5cf94332
Automated submodule synchronization (#2927)
2 years ago
github-actions[bot] da056285f2
[format] applied code formatting on changed files in pull request 2922 (#2923)
2 years ago
binmakeswell 12bafe057f
[doc] update installation for GPT (#2922)
2 years ago
binmakeswell 0afb55fc5b
[doc] add os scope, update tutorial install and tips (#2914)
2 years ago
YH 7b13f7db18
[zero] trivial zero optimizer refactoring (#2869)
2 years ago
fastalgo dbc01b9c04
Update README.md
2 years ago
Frank Lee e33c043dec
[workflow] moved pre-commit to post-commit (#2895)
2 years ago