InternLM/internlm
Wenwen Qu ebf2942746 import internlm.moe 2024-01-12 14:01:23 +08:00
..
apis feat(tools): support origin internlm architecture in web_demo (#478) 2023-11-09 20:01:55 +08:00
core fix(pp): fix no-packed dataset load micro batch error (#538) 2023-12-13 14:48:32 +08:00
data fix the type_ids when micro_num=1 and use_flash_attn=False (#516) 2023-12-06 14:38:28 +08:00
initialize use regester to get moe impl 2024-01-12 13:20:17 +08:00
model import internlm.moe 2024-01-12 14:01:23 +08:00
moe import internlm.moe 2024-01-12 14:01:23 +08:00
monitor fix(alert): send exception of all ranks (#491) 2023-11-10 19:04:31 +08:00
solver fix when resuming lr_scheduler without loading optimizer (#565) 2023-12-29 20:22:39 +08:00
train fix the type_ids when micro_num=1 and use_flash_attn=False (#516) 2023-12-06 14:38:28 +08:00
utils refactor code 2024-01-08 16:03:55 +08:00
__init__.py initial commit 2023-07-06 12:55:23 +08:00