giboulot's picture
9 35

giboulot

danypropsy
·

AI & ML interests

None yet

Recent Activity

liked a model about 18 hours ago
AIDC-AI/Marco-o1
liked a Space about 20 hours ago
llamameta/llama3.1-405B
upvoted a collection about 20 hours ago
Gemma 3
View all activity

Organizations

None yet

danypropsy's activity

reacted to prithivMLmods's post with 🤗 about 20 hours ago
view post
Post
1674
Gemma-3-4B : Image and Video Inference 🖼️🎥

🧤Space: prithivMLmods/Gemma-3-Multimodal

@gemma3-4b : {Tag + Space_+ 'prompt'}
@video-infer : {Tag + Space_+ 'prompt'}
By default, it runs: prithivMLmods/Qwen2-VL-OCR-2B-Instruct

Gemma 3 Technical Report : https://storage.googleapis.com/deepmind-media/gemma/Gemma3Report.pdf

Additionally, I have also tested Aya-Vision 8B vs Custom Qwen2-VL-OCR for OCR with test case samples on messy handwriting for experimental purposes to optimize edge device VLMs for Optical Character Recognition.

📜Read the blog here: https://huggingface.co/blog/prithivMLmods/aya-vision-vs-qwen2vl-ocr-2b
  • 1 reply
·
updated a collection about 2 months ago
liked a Space 2 months ago