InternLM/internlm/core
Guoteng f6e007f95b
feat(ckpt): fix checkpoint bugs and add feature enhancements. (#259)
* fix(ckpt): ckpt bug fix and api refactor
1. fix latest ckpt query bug
2. add ckpt unit test
3. fix storage manager boto3/local client get_fns bug
4. fix only model load case zero fp32 buffer overwrite model weights bug.
5. add ckpt_type and add zero reload ci-test

* fix(ckpt): fix ckpt and trainer bug

* fix and refactor

* fix base on comment

* feat: add legacy api
2023-09-05 17:40:48 +08:00
..
communication Merge develop to main (#233) 2023-08-24 22:03:04 +08:00
context Feat/add runntime gpu test (#254) 2023-09-01 13:38:01 +08:00
scheduler Merge develop to main (#233) 2023-08-24 22:03:04 +08:00
__init__.py initial commit 2023-07-06 12:55:23 +08:00
engine.py initial commit 2023-07-06 12:55:23 +08:00
gradient_handler.py initial commit 2023-07-06 12:55:23 +08:00
naive_amp.py Merge develop to main (#233) 2023-08-24 22:03:04 +08:00
trainer.py feat(ckpt): fix checkpoint bugs and add feature enhancements. (#259) 2023-09-05 17:40:48 +08:00