InternLM/internlm/model
Wenwen Qu 0e6b1f856c add support for moe checkpoint 2023-08-24 17:01:14 +08:00
..
__init__.py feat(model/metrics.py): support calculating accuracy and perplexity m… (#91) 2023-07-26 16:22:10 +08:00
embedding.py Feat/example training internlm (#212) 2023-08-24 10:00:15 +08:00
linear.py fix(*): move sequence_parallel to parallel config (#224) 2023-08-24 09:49:04 +08:00
loss.py initial commit 2023-07-06 12:55:23 +08:00
metrics.py feat(*): support not-flash-attn for pp and no-pp (#145) 2023-07-28 16:13:04 +08:00
modeling_internlm.py merge 2023-08-24 16:38:36 +08:00
moe.py add support for moe checkpoint 2023-08-24 17:01:14 +08:00
multi_head_attention.py fix(*): move sequence_parallel to parallel config (#224) 2023-08-24 09:49:04 +08:00
norm.py Feat/optimizer (#194) 2023-08-15 18:55:10 +08:00
utils.py Merge main to develop (#203) 2023-08-16 15:57:26 +08:00