InternLM/internlm
Wenwen Qu 95263fa1d0 merge operands in topk gating 2023-12-01 15:04:49 +08:00
..
apis feat(tools): support origin internlm architecture in web_demo (#478) 2023-11-09 20:01:55 +08:00
core feat(train): support_rampup_batch_size and fix bugs (#493) 2023-11-16 19:51:01 +08:00
data feat(data): walk folder to get dataset_type_ids_map (#477) 2023-11-07 19:21:10 +08:00
initialize feat(seed): set global seed for every model initialization (#496) 2023-11-17 14:42:50 +08:00
model feat(model): add rope_base interface (#512) 2023-11-23 16:30:14 +08:00
moe merge operands in topk gating 2023-12-01 15:04:49 +08:00
monitor fix(alert): send exception of all ranks (#491) 2023-11-10 19:04:31 +08:00
solver rename vars (#468) 2023-11-09 20:04:35 +08:00
train feat(train): update get_train_data_loader to make logic clearer (#498) 2023-11-14 17:05:15 +08:00
utils fix(timeout): larger timeout (#495) 2023-11-21 19:19:22 +08:00
__init__.py initial commit 2023-07-06 12:55:23 +08:00