__init__.py
|
initial commit
|
2023-07-06 12:55:23 +08:00 |
checkpoint.py
|
initial commit
|
2023-07-06 12:55:23 +08:00 |
common.py
|
feat(train.py): support torch profiler (#201)
|
2023-08-21 15:23:38 +08:00 |
evaluation.py
|
merge
|
2023-08-24 16:38:36 +08:00 |
megatron_timers.py
|
Feat/overlap_bcast_forward (#218)
|
2023-08-23 16:59:59 +08:00 |
model_checkpoint.py
|
add comments for moe
|
2023-08-25 19:03:31 +08:00 |
parallel.py
|
fix moe bugs in zero optimizer
|
2023-08-17 16:11:34 +08:00 |
timeout.py
|
Merge main to develop (#203)
|
2023-08-16 15:57:26 +08:00 |
writer.py
|
fix(writer): fix tensorboard resume bug (#229)
|
2023-08-24 17:38:39 +08:00 |