1 3 19

Justus Tobias PRO

justus-tobias

https://justus-tobias.de

j-tobias

AI & ML interests

Multimodal Learning, Representation Learning, Audio Processing

Recent Activity

liked a Space 23 days ago

smolagents/smolagents-leaderboard

liked a model about 2 months ago

Zyphra/Zonos-v0.1-hybrid

liked a model about 2 months ago

Qwen/Qwen2.5-VL-7B-Instruct

View all activity

Organizations

None yet

justus-tobias's activity

liked a Space 23 days ago

117

smolagents LLM leaderboard

🏆

A leaderboard for LLMs powering smolagents

liked 2 models about 2 months ago

Zyphra/Zonos-v0.1-hybrid

Text-to-Speech • Updated Feb 15 • 14.7k • 1.06k

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated 12 days ago • 3.34M • • 783

liked a model 2 months ago

THUDM/CogVideoX1.5-5B-I2V

Image-to-Video • Updated 17 days ago • 9.02k • 96

updated a Space 3 months ago

Heartbeat

💜

upvoted a paper 4 months ago

Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs

Paper • 2411.02256 • Published Nov 4, 2024 • 1

liked a model 4 months ago

tencent/HunyuanVideo

Text-to-Video • Updated 28 days ago • 3.09k • • 1.8k

upvoted a paper 4 months ago

AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset

Paper • 2311.15308 • Published Nov 26, 2023 • 2

liked a Space 4 months ago

Gradio Demo Space creation helper V2

🐶

Generate Gradio demo files for Hugging Face model repos

updated a Space 6 months ago

Moshi

💨

Create interactive spoken dialogue using audio input

upvoted a paper 6 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 112

liked a Space 7 months ago

708

Open ASR Leaderboard

🏆

Request and view assessments for speech recognition models

liked a Space 8 months ago

952

Seamless M4T

📞

updated a dataset 8 months ago

justus-tobias/TestDataset

Updated Aug 15, 2024 • 4

liked a Space 8 months ago

gradio_pdf V0.10.0

🚀

Ask questions about PDF documents

liked a model 8 months ago

facebook/wav2vec2-base-960h

Automatic Speech Recognition • Updated Nov 14, 2022 • 3.57M • • 327

liked 2 datasets 8 months ago

openslr/librispeech_asr

Updated Aug 14, 2024 • 13.9k • 141

MLCommons/peoples_speech

Viewer • Updated Nov 20, 2024 • 8.05M • 33.6k • 101

liked a Space 9 months ago

305

AudioLDM2 Text2Audio Text2Music Generation

🔊

Generate audio and waveform video from text

liked a Space 10 months ago

135

Exbert

🌍

Explore BERT model interactions