InternLM/internlm/train
JiaoPL 4eed07a3c3 compute vocab grad norm && save pt 2023-11-28 12:13:23 +08:00
..
__init__.py feat(train): add fsdp training option (#293) 2023-10-09 18:59:31 +08:00
training_internlm.py compute vocab grad norm && save pt 2023-11-28 12:13:23 +08:00
utils.py fix(moe): remove norm&gate force sync (#448) 2023-11-01 11:29:55 +08:00