Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
23
47
44
Joya Chen
PRO
chenjoya
Follow
SamuraiBarbi's profile picture
SteveSHEN's profile picture
Aleniles's profile picture
22 followers
·
12 following
https://chenjoya.github.io/
chenjoya
AI & ML interests
Video LLM
Recent Activity
liked
a model
17 days ago
google/gemma-3n-E4B-it
upvoted
a
paper
18 days ago
HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context
upvoted
a
paper
28 days ago
Show-o2: Improved Native Unified Multimodal Models
View all activity
Organizations
chenjoya
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
17 days ago
google/gemma-3n-E4B-it
Image-Text-to-Text
•
8B
•
Updated
5 days ago
•
284k
•
602
liked
a model
30 days ago
showlab/show-o2-7B
Any-to-Any
•
Updated
28 days ago
•
812
•
11
liked
2 models
about 2 months ago
pyannote/speaker-diarization-3.1
Automatic Speech Recognition
•
Updated
May 10, 2024
•
14.1M
•
980
nvidia/diar_sortformer_4spk-v1
Audio Classification
•
Updated
Feb 3
•
34.5k
•
67
liked
8 datasets
2 months ago
parler-tts/mls_eng
Viewer
•
Updated
Apr 9, 2024
•
10.8M
•
6.82k
•
26
edinburghcstr/ami
Viewer
•
Updated
Jan 16, 2023
•
110k
•
2.18k
•
58
mozilla-foundation/common_voice_17_0
Viewer
•
Updated
Jun 16, 2024
•
13M
•
31.5k
•
316
facebook/multilingual_librispeech
Viewer
•
Updated
Aug 12, 2024
•
1.49M
•
9.74k
•
141
facebook/voxpopuli
Updated
Oct 14, 2022
•
6.95k
•
125
CSTR-Edinburgh/vctk
Updated
Aug 14, 2024
•
412
•
43
MLCommons/peoples_speech
Viewer
•
Updated
Nov 20, 2024
•
8.05M
•
43.4k
•
127
MERaLiON/Multitask-National-Speech-Corpus-v1
Viewer
•
Updated
Jan 21
•
15.2M
•
22.6k
•
15
liked
2 models
2 months ago
nvidia/parakeet-tdt-0.6b-v2
Automatic Speech Recognition
•
Updated
23 days ago
•
1.26M
•
1.23k
THUDM/glm-4-voice-tokenizer
0.4B
•
Updated
Oct 25, 2024
•
163k
•
10
liked
a Space
2 months ago
Running
on
Zero
14
14
FlexTok
🖼
FlexTok flexible sequence length autoencoding demo
liked
a model
2 months ago
moonshotai/Kimi-Audio-7B-Instruct
Text-to-Speech
•
10B
•
Updated
May 29
•
7.78k
•
•
331
liked
a Space
2 months ago
Running
137
137
Seed1.5 VL
🚀
Seed1.5-VL API Demo
liked
3 models
2 months ago
maitrix-org/Voila-Tokenizer
Audio-to-Audio
•
0.1B
•
Updated
May 6
•
5.75k
•
5
Qwen/Qwen2.5-Omni-3B
Any-to-Any
•
6B
•
Updated
Apr 30
•
167k
•
253
inclusionAI/Ming-Lite-Uni
Image-Text-to-Text
•
Updated
May 14
•
10
Load more