Article
SmolVLM2: Bringing Video Understanding to Every Device
โข
134
I works flawlessly on hugging face, while trying to run in local the listening and processing events are fluctuating almost 60 times or more every minute, making the api non usable.