speculative or chat model itself?
#1
by
FM-1976
- opened
ciao afrideva, big fan of yours.
I saw many super tiny language models in your repo.
Are them for speculative decoding, or can we consider them as chat model themselves?
I am running my tests meantime,
but I would like to write a medium article about sub 500M parameter models.
thanks