8 5 14

Merve Noyan

mervenoyan

AI & ML interests

Natural language understanding, chatbots, information extraction

Recent Activity

new activity 1 day ago

nvidia/DAM-3B-Video:Add license

new activity 1 day ago

nvidia/DAM-3B:Add license

new activity 1 day ago

nvidia/DAM-3B:Code snippet to use the model

View all activity

Organizations

mervenoyan's activity

New activity in nvidia/DAM-3B-Video 1 day ago

Add license

#1 opened 1 day ago by

mervenoyan

New activity in nvidia/DAM-3B 1 day ago

Add license

#2 opened 1 day ago by

mervenoyan

Code snippet to use the model

#1 opened 1 day ago by

mervenoyan

New activity in nvidia/DAM-3B-Self-Contained 1 day ago

License

#1 opened 1 day ago by

merve

posted an update 9 days ago

Post

526

Why do people sleep on DSE multimodal retrieval models? 👀

They're just like ColPali, but highly scalable, fast and you can even make them more efficient with binarization or matryoshka with little degradation 🪆

I made a small collection of them so you can get started merve/multimodal-dse-retrievers-67fe71a9c8f1ad26a48859c3

Image taken from MCDSE blog https://huggingface.co/blog/marco/announcing-mcdse-2b-v1

authored a paper 16 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 17 days ago • 171

liked a model about 2 months ago

intfloat/mmE5-mllama-11b-instruct

Zero-Shot Image Classification • Updated Feb 27 • 718 • 18

upvoted a collection 3 months ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 11 items • Updated 24 days ago • 448

liked a model 5 months ago

NexaAIDev/OmniVLM-968M

Updated Dec 17, 2024 • 1.35k • 515

posted an update 6 months ago

Post

2356

we have a leaderboard for video LLMs, and most of the top models are open ones! opencompass/openvlm_video_leaderboard 👑👏
we are so back 🔥

liked a model 7 months ago

nvidia/NVLM-D-72B

Image-Text-to-Text • Updated Jan 14 • 14.9k • 769

upvoted 2 papers 7 months ago

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 75

BRAVE: Broadening the visual encoding of vision-language models

Paper • 2404.07204 • Published Apr 10, 2024 • 19

reacted to Taylor658's post with 🤗 11 months ago

Post

2029

huggingface, SakanaAILabs and @arcee_ai are sponsoring a Model Merging Competition with really sweet 💰cash prizes💰 at the 2024 NeurIPSConf! (https://neurips.cc) 🎉

Submissions are now open and will remain open until September 2024. 🚀

🔗 Register here: https://llm-merging.github.io/
🗣️ Join the Discord discussion: https://discord.com/invite/dPBHEVnV

1 reply

liked 3 models 11 months ago

liked 2 Spaces over 1 year ago

282

Aitube2

🚀

Explore AI-generated videos in 2025

Llava

🏢

Chat with LLaVA using images and text