InternLM/internlm
Wenwen Qu 8a595837fc merge upstream/develop into feature_add_moe 2023-09-11 16:20:08 +08:00
..
apis initial commit 2023-07-06 12:55:23 +08:00
core merge upstream/develop into feature_add_moe 2023-09-11 16:20:08 +08:00
data feat(data/utils.py): add new dataset type code for streaming dataset (#225) 2023-08-24 13:46:18 +08:00
initialize merge upstream/develop into feature_add_moe 2023-09-11 16:20:08 +08:00
model merge upstream/develop into feature_add_moe 2023-09-11 16:20:08 +08:00
moe replace flashatten experts by feedforward experts 2023-09-08 18:04:57 +08:00
monitor merge upstream/develop into feature_add_moe 2023-09-11 16:20:08 +08:00
solver merge upstream/develop into feature_add_moe 2023-09-11 16:20:08 +08:00
train merge upstream/develop into feature_add_moe 2023-09-11 16:20:08 +08:00
utils merge upstream/develop into feature_add_moe 2023-09-11 16:20:08 +08:00
__init__.py initial commit 2023-07-06 12:55:23 +08:00