intfloat/mmE5-mllama-11b-instruct Zero-Shot Image Classification β’ Updated 20 days ago β’ 5.66k β’ 17
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 β’ 8 items β’ Updated 23 days ago β’ 400
view post Post 2322 we have a leaderboard for video LLMs, and most of the top models are open ones! opencompass/openvlm_video_leaderboard ππwe are so back π₯ π₯ 9 9 + Reply
BRAVE: Broadening the visual encoding of vision-language models Paper β’ 2404.07204 β’ Published Apr 10, 2024 β’ 19
view post Post 2026 huggingface, SakanaAILabs and @arcee_ai are sponsoring a Model Merging Competition with really sweet π°cash prizesπ° at the 2024 NeurIPSConf! (https://neurips.cc) π Submissions are now open and will remain open until September 2024. ππ Register here: https://llm-merging.github.io/π£οΈ Join the Discord discussion: https://discord.com/invite/dPBHEVnV 1 reply Β· π€ 7 7 + Reply