InternLM/internlm/utils
gaoyang07 fdbdfcff34 remove micro_bsz 2023-11-25 22:44:20 +08:00
..
__init__.py initial commit 2023-07-06 12:55:23 +08:00
checkpoint.py initial commit 2023-07-06 12:55:23 +08:00
common.py fix(timeout): larger timeout (#495) 2023-11-21 19:19:22 +08:00
evaluation.py remove micro_bsz 2023-11-25 22:44:20 +08:00
gputest.py fix(utils): disable bench_net in gputest.py (#421) 2023-10-19 10:00:57 +08:00
logger.py feat(utils): add timeout warpper for key functions (#286) 2023-09-07 17:26:17 +08:00
megatron_timers.py feat: add runtime diag (#297) 2023-09-08 17:56:46 +08:00
model_checkpoint.py feat(ckpt): save ckpt when reach total step count (#486) 2023-11-09 21:07:16 +08:00
parallel.py feat(optimizer): zero gradient count (#449) 2023-10-27 16:26:55 +08:00
registry.py Merge develop to main (#233) 2023-08-24 22:03:04 +08:00
simple_memory_profiler.py fix(moe): fix moe compatibility for fsdp and memory profiling (#417) 2023-10-17 14:13:48 +08:00
storage_manager.py fix(os): fix FileNotFoundError in storage_manager (#455) 2023-10-27 22:32:46 +08:00
timeout.py fix(timeout): larger timeout (#495) 2023-11-21 19:19:22 +08:00
writer.py fix(train): unify the exp paths (#492) 2023-11-11 20:15:59 +08:00