Andres Marafioti's picture

Andres Marafioti

andito

·

AI & ML interests

Multimodal models, VLM and TTS

Recent Activity

liked a dataset 7 days ago

HuggingFaceFV/longvideos2

View all activity

Organizations

andito's activity

commented 2 papers 16 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 16 days ago • 170 •

Slow-Fast Architecture for Video Multi-Modal Large Language Models

Paper • 2504.01328 • Published 22 days ago • 8 •

New activity in HuggingFaceTB/SmolVLM-Instruct 20 days ago

How many parameters are there in the model?

#26 opened 3 months ago by

commented 2 papers 21 days ago

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Paper • 2504.00595 • Published 23 days ago • 35 •

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Paper • 2504.00595 • Published 23 days ago • 35 •

commented 2 papers about 1 month ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 97 •

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 97 •

New activity in HuggingFaceTB/SmolVLM-256M-Instruct 3 months ago

Add ONNX sample code

#8 opened 3 months ago by

Upload photo_2025-01-25_13-45-22.jpg

#5 opened 3 months ago by

There is an issue with AutoProcessor

#6 opened 3 months ago by

New activity in HuggingFaceTB/SmolVLM-500M-Instruct 3 months ago

Upload ONNX weights

#1 opened 3 months ago by

New activity in HuggingFaceTB/SmolVLM-256M-Instruct 3 months ago

[WIP] Upload ONNX weights

#1 opened 3 months ago by

New activity in HuggingFaceM4/Idefics3-8B-Llama3 5 months ago

Remove PR message

#19 opened 6 months ago by

New activity in HuggingFaceTB/SmolVLM-Instruct 5 months ago

GGUF format?

#12 opened 5 months ago by

Upload ONNX weights + chat template fixes

#13 opened 5 months ago by

New activity in HuggingFaceTB/SmolVLM 5 months ago

Update app.py

#3 opened 5 months ago by

New activity in HuggingFaceTB/SmolVLM-Instruct 5 months ago

Best option for DocQVA->JSON

#11 opened 5 months ago by

ValueError: `resolution_max_side` cannot be larger than `max_image_size` with N=5

#9 opened 5 months ago by

loading images locally?

#8 opened 5 months ago by

Will this work with vLLM?

#10 opened 5 months ago by