Commit Graph

5 Commits (7cec7e985f392d9be6a839c455f3c39e25a24a90)

Author SHA1 Message Date
Guoteng 757e19e01a
1. fix(config): rampup_batch_size defalut value BC. (#515)
2. fix(config): standardize config parameter access.
3. feat(launch): add warmup_process_group
4. feat(memory): add cuda_memory_analyze
2023-11-28 19:33:46 +08:00
Guoteng 3ea94f2e2a
fix(utils): disable bench_net in gputest.py (#421) 2023-10-19 10:00:57 +08:00
jiaxingli f5337f6e02
Feat(PythonGC): Do garbage collection manually (#326)
* feat:add gc control

* feat:add gc control

* feat:add gc control

* feat:add gc

* re-lint
2023-09-22 13:52:25 +08:00
Sun Peng 1ee31ff9b1
feat: add runtime diag (#297)
* feat: add runtime diag

* add diag_outlier_ratio

---------

Co-authored-by: yingtongxiong <974106207@qq.com>
2023-09-08 17:56:46 +08:00
Sun Peng 860de0aa46
Feat/add runntime gpu test (#254)
* feat: add gpu bench

* feat/add allreduce runtime bench

---------

Co-authored-by: sunpengsdu <sunpengsdu@gmail.com>
2023-09-01 13:38:01 +08:00