Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
ver217 304263c2ce
fix gpt attention mask (#461)
3 years ago
..
gpt fix gpt attention mask (#461) 3 years ago
moe added Multiply Jitter and capacity factor eval for MOE (#434) 3 years ago
vit
__init__.py
helper.py