Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
HELSON dbdc9a7783
added Multiply Jitter and capacity factor eval for MOE (#434)
3 years ago
..
amp fixed fp16 optimizer none grad bug (#432) 3 years ago
builder
communication
context
engine
kernel
logging
nn added Multiply Jitter and capacity factor eval for MOE (#434) 3 years ago
registry
trainer
utils fixed mem monitor device (#433) 3 years ago
zero sync before creating empty grad 3 years ago
__init__.py
constants.py
core.py
global_variables.py
initialize.py