InternLM

History

Wenwen Qu fe0c342f9d get moe setting from gpc		2024-01-09 15:26:13 +08:00
..
apis	feat(tools): support origin internlm architecture in web_demo (#478 )	2023-11-09 20:01:55 +08:00
core	fix(pp): fix no-packed dataset load micro batch error (#538 )	2023-12-13 14:48:32 +08:00
data	fix the type_ids when micro_num=1 and use_flash_attn=False (#516 )	2023-12-06 14:38:28 +08:00
initialize	feat(logger): add tensorboard key value buffer (#549 )	2023-12-29 16:23:47 +08:00
model	get moe setting from gpc	2024-01-09 15:26:13 +08:00
moe	get moe setting from gpc	2024-01-09 15:26:13 +08:00
monitor	fix(alert): send exception of all ranks (#491 )	2023-11-10 19:04:31 +08:00
solver	fix when resuming lr_scheduler without loading optimizer (#565 )	2023-12-29 20:22:39 +08:00
train	fix the type_ids when micro_num=1 and use_flash_attn=False (#516 )	2023-12-06 14:38:28 +08:00
utils	refactor code	2024-01-08 16:03:55 +08:00
__init__.py	initial commit	2023-07-06 12:55:23 +08:00