speculative or chat model itself?

by FM-1976 - opened

ciao afrideva, big fan of yours.
I saw many super tiny language models in your repo.
Are them for speculative decoding, or can we consider them as chat model themselves?

I am running my tests meantime,
but I would like to write a medium article about sub 500M parameter models.

Sign up or log in to comment