.. |
amp
|
polish amp docstring (#616)
|
2022-04-01 16:09:39 +08:00 |
builder
|
html refactor (#555)
|
2022-03-31 11:36:56 +08:00 |
communication
|
[model checkpoint] updated communication ops for cpu tensors (#590)
|
2022-04-01 16:52:20 +08:00 |
context
|
[model checkpoint] add gloo groups for cpu tensor communication (#589)
|
2022-04-01 10:15:52 +08:00 |
engine
|
[refactor] memory utils (#577)
|
2022-04-01 09:22:33 +08:00 |
kernel
|
[cuda] modify the fused adam, support hybrid of fp16 and fp32 (#497)
|
2022-03-25 14:15:53 +08:00 |
logging
|
Refactored docstring to google style
|
2022-03-29 17:17:47 +08:00 |
nn
|
[model checkpoint] updated saving/loading for 1d layers (#594)
|
2022-04-01 16:51:52 +08:00 |
registry
|
Refactored docstring to google style
|
2022-03-29 17:17:47 +08:00 |
testing
|
[test] fixed rerun_on_exception and adapted test cases (#487)
|
2022-03-25 17:25:12 +08:00 |
trainer
|
html refactor (#555)
|
2022-03-31 11:36:56 +08:00 |
utils
|
[model checkpoint] updated checkpoint save/load utils (#592)
|
2022-04-01 16:49:21 +08:00 |
zero
|
polish docstring of zero (#612)
|
2022-04-01 14:50:56 +08:00 |
__init__.py
|
Develop/experiments (#59)
|
2021-12-09 15:08:29 +08:00 |
constants.py
|
fix format constants.py (#358)
|
2022-03-11 15:50:28 +08:00 |
core.py
|
[polish] polish singleton and global context (#500)
|
2022-03-23 18:03:39 +08:00 |
global_variables.py
|
[MOE] add unitest for MOE experts layout, gradient handler and kernel (#469)
|
2022-03-21 13:35:04 +08:00 |
initialize.py
|
Refactored docstring to google style
|
2022-03-29 17:17:47 +08:00 |