sha1779/BengaliRegionalASR_barishal_sylhet Automatic Speech Recognition โข Updated 11 days ago โข 15
view post Post 3358 VLMs are going through quite an open revolution AND on-device friendly sizes:1. Google DeepMind w/ PaliGemma2 - 3B, 10B & 28B: google/paligemma-2-release-67500e1e1dbfdd4dee27ba482. OpenGVLabs w/ InternVL 2.5 - 1B, 2B, 4B, 8B, 26B, 38B & 78B: https://huggingface.co/collections/OpenGVLab/internvl-25-673e1019b66e2218f68d7c1c3. Qwen w/ Qwen 2 VL - 2B, 7B & 72B: Qwen/qwen2-vl-66cee7455501d7126940800d4. Microsoft w/ FlorenceVL - 3B & 8B: https://huggingface.co/jiuhai5. Moondream2 w/ 0.5B: https://huggingface.co/vikhyatk/What a time to be alive! ๐ฅ See translation ๐ฅ 11 11 ๐ 4 4 + Reply