Hongxin Liu
|
079bf3cb26
|
[misc] update pre-commit and run all files (#4752)
* [misc] update pre-commit
* [misc] run pre-commit
* [misc] remove useless configuration files
* [misc] ignore cuda for clang-format
|
2023-09-19 14:20:26 +08:00 |
FoolPlayer
|
c3ca53cf05
|
[test] skip some not compatible models
|
2023-08-15 23:25:14 +08:00 |
Hongxin Liu
|
411cf1d2db
|
[hotfix] fix gemini and zero test (#4333)
* [hotfix] fix gemini and zero test
* [hotfix] fix lazy init test
* [hotfix] fix lazy init test
|
2023-08-15 23:25:14 +08:00 |
Hongxin Liu
|
16bf4c0221
|
[test] remove useless tests (#4359)
* [test] remove legacy zero test
* [test] remove lazy distribute test
* [test] remove outdated checkpoint io
|
2023-08-01 18:52:14 +08:00 |
Hongxin Liu
|
fc5cef2c79
|
[lazy] support init on cuda (#4269)
* [lazy] support init on cuda
* [test] update lazy init test
* [test] fix transformer version
|
2023-07-19 16:43:01 +08:00 |
Frank Lee
|
c4b1b65931
|
[test] fixed tests failed due to dtensor change (#4082)
* [test] fixed tests failed due to dtensor change
* polish code
|
2023-07-04 16:05:01 +08:00 |
Frank Lee
|
8eb09a4c69
|
[shardformer] support module saving and loading (#4062)
* [shardformer] support module saving and loading
* polish code
|
2023-07-04 16:05:01 +08:00 |
Frank Lee
|
58df720570
|
[shardformer] adapted T5 and LLaMa test to use kit (#4049)
* [shardformer] adapted T5 and LLaMa test to use kit
* polish code
|
2023-07-04 16:05:01 +08:00 |
Frank Lee
|
ddcf58cacf
|
Revert "[sync] sync feature/shardformer with develop"
|
2023-06-09 09:41:27 +08:00 |
Frank Lee
|
eb39154d40
|
[dtensor] updated api and doc (#3845)
|
2023-06-08 10:18:17 +08:00 |
Hongxin Liu
|
dbb32692d2
|
[lazy] refactor lazy init (#3891)
* [lazy] remove old lazy init
* [lazy] refactor lazy init folder structure
* [lazy] fix lazy tensor deepcopy
* [test] update lazy init test
|
2023-06-05 14:20:47 +08:00 |