64 28 41

Johannes Kolbe PRO

johko

johko

AI & ML interests

None yet

Recent Activity

reacted to AdinaY's post with 🚀 about 13 hours ago

SkyReels-V2 🔥 UNLIMITED duration video generation model by Kunlun Tech-Skywork 天工 Paper: https://huggingface.co/papers/2504.13074 Model: https://huggingface.co/collections/Skywork/skyreels-v2-6801b1b93df627d441d0d0d9 ✨ 1.3B & 14B ✨ Generates infinite length videos using Diffusion Forcing with diffusion models + autoregressive methods

updated a Space 5 days ago

johko/NSQL-Text-To-SQL

upvoted a paper 6 days ago

MIEB: Massive Image Embedding Benchmark

View all activity

Organizations

johko's activity

upvoted a paper 6 days ago

MIEB: Massive Image Embedding Benchmark

Paper • 2504.10471 • Published 8 days ago • 15

upvoted an article 3 months ago

Article

State of open video generation models in Diffusers

Jan 27

• 51

upvoted a paper 7 months ago

HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning

Paper • 2407.15680 • Published Jul 22, 2024 • 1

upvoted an article 8 months ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22, 2024

• 87

upvoted a paper 8 months ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 130

upvoted a paper 9 months ago

LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models

Paper • 2407.07895 • Published Jul 10, 2024 • 43

upvoted a collection about 1 year ago

🎭 Avatars

Collection

The latest AI-powered technologies usher in a new era of realistic avatars! 🚀 • 75 items • Updated 2 days ago • 87

upvoted a paper about 1 year ago

FeatUp: A Model-Agnostic Framework for Features at Any Resolution

Paper • 2403.10516 • Published Mar 15, 2024 • 16

upvoted a collection about 1 year ago

Matryoshka Embedding Models

Collection

https://huggingface.co/blog/matryoshka • 14 items • Updated 28 days ago • 16

upvoted 3 papers about 1 year ago

How Easy is It to Fool Your Multimodal LLMs? An Empirical Analysis on Deceptive Prompts

Paper • 2402.13220 • Published Feb 20, 2024 • 15

Aria Everyday Activities Dataset

Paper • 2402.13349 • Published Feb 20, 2024 • 32

PokéLLMon: A Human-Parity Agent for Pokémon Battles with Large Language Models

Paper • 2402.01118 • Published Feb 2, 2024 • 32

upvoted a collection about 1 year ago

AIM

Collection

AIM: Autoregressive Image Models • 5 items • Updated Oct 29, 2024 • 49

upvoted 5 papers over 1 year ago