InternLM/internlm/model
Wenwen Qu c76182b2d6
Merge branch 'develop' into feature_add_moe
2023-08-17 16:37:06 +08:00
..
__init__.py feat(model/metrics.py): support calculating accuracy and perplexity m… (#91) 2023-07-26 16:22:10 +08:00
embedding.py feat(monitor): support monitor and alert (#175) 2023-08-08 11:18:15 +08:00
linear.py feat(monitor): support monitor and alert (#175) 2023-08-08 11:18:15 +08:00
loss.py initial commit 2023-07-06 12:55:23 +08:00
metrics.py feat(*): support not-flash-attn for pp and no-pp (#145) 2023-07-28 16:13:04 +08:00
modeling_internlm.py Merge branch 'develop' into feature_add_moe 2023-08-17 16:37:06 +08:00
moe.py avoid moe parameter partition in zero optimizer 2023-08-15 17:52:51 +08:00
multi_head_attention.py feat(*): support sequence_parallel (#180) 2023-08-07 16:42:52 +08:00
norm.py Feat/optimizer (#194) 2023-08-15 18:55:10 +08:00
utils.py Merge main to develop (#203) 2023-08-16 15:57:26 +08:00