InternLM/internlm/model
Pryest b38ba5dad2 Fit to flash attention 1.0.5. 2023-10-09 21:03:16 +08:00
..
__init__.py feat(moe): add moe module (#182) 2023-09-27 15:54:53 +08:00
embedding.py fix(model/embedding.py): ci lint check error (#345) 2023-09-21 14:46:22 +08:00
linear.py feat(linear): optimize mlp by using jit (#321) 2023-09-19 14:57:43 +08:00
loss.py initial commit 2023-07-06 12:55:23 +08:00
metrics.py fix(metric): argument missing in getting loss metrics. (#256) 2023-08-31 17:44:39 +08:00
modeling_internlm.py feat(mdoel): add DynamicNTKScalingRotaryEmbedding (#339) 2023-09-20 23:31:47 +08:00
modeling_moe.py feat(moe): add moe module (#182) 2023-09-27 15:54:53 +08:00
moe.py feat(moe): add moe module (#182) 2023-09-27 15:54:53 +08:00
multi_head_attention.py Fit to flash attention 1.0.5. 2023-10-09 21:03:16 +08:00
norm.py Merge develop to main (#233) 2023-08-24 22:03:04 +08:00
utils.py feat(moe): add moe module (#182) 2023-09-27 15:54:53 +08:00