--- license: mit --- - Working in progress... - # of parameters: 773M (708M without LM Head)