InternLM/internlm/model
Wenwen Qu 929dd36cf2 avoid moe parameter partition in zero optimizer 2023-08-15 17:52:51 +08:00
..
__init__.py feat(model/metrics.py): support calculating accuracy and perplexity m… (#91) 2023-07-26 16:22:10 +08:00
embedding.py feat(monitor): support monitor and alert (#175) 2023-08-08 11:18:15 +08:00
linear.py feat(monitor): support monitor and alert (#175) 2023-08-08 11:18:15 +08:00
loss.py initial commit 2023-07-06 12:55:23 +08:00
metrics.py feat(*): support not-flash-attn for pp and no-pp (#145) 2023-07-28 16:13:04 +08:00
modeling_internlm.py modified: internlm/model/modeling_internlm.py 2023-08-11 11:58:22 +08:00
moe.py avoid moe parameter partition in zero optimizer 2023-08-15 17:52:51 +08:00
multi_head_attention.py feat(*): support sequence_parallel (#180) 2023-08-07 16:42:52 +08:00
norm.py feat(*): support no apex (#166) 2023-08-02 20:32:38 +08:00
utils.py feat(monitor): support monitor and alert (#175) 2023-08-08 11:18:15 +08:00