.. |
commons
|
[zero] polish sharded param name (#484)
|
2022-03-22 14:36:16 +08:00 |
data_sampler
|
fixed utils docstring and add example to readme (#200)
|
2022-02-03 11:37:17 +08:00 |
gradient_accumulation
|
Fixed docstring in colossalai (#171)
|
2022-01-21 10:44:30 +08:00 |
memory_tracer
|
fixed mem monitor device (#433)
|
2022-03-16 15:25:02 +08:00 |
multi_tensor_apply
|
Fixed docstring in colossalai (#171)
|
2022-01-21 10:44:30 +08:00 |
profiler
|
fixed error when no collective communication in CommProfiler
|
2022-03-14 17:21:00 +08:00 |
tensor_detector
|
Added tensor detector (#393)
|
2022-03-14 18:01:46 +08:00 |
__init__.py
|
[MOE] remove old MoE legacy (#493)
|
2022-03-22 17:37:16 +08:00 |
activation_checkpoint.py
|
Added activation offload (#331)
|
2022-03-11 15:50:28 +08:00 |
checkpointing.py
|
fixed mkdir conflict and align yapf config with flake (#220)
|
2022-02-15 11:31:13 +08:00 |
common.py
|
[MOE] remove old MoE legacy (#493)
|
2022-03-22 17:37:16 +08:00 |
cuda.py
|
Fixed docstring in colossalai (#171)
|
2022-01-21 10:44:30 +08:00 |
memory.py
|
Fixed docstring in colossalai (#171)
|
2022-01-21 10:44:30 +08:00 |
moe.py
|
[format] polish name format for MOE (#481)
|
2022-03-21 23:19:47 +08:00 |
timer.py
|
[profiler] primary memory tracer
|
2022-03-11 15:50:28 +08:00 |