27 46 53

Steven Zheng

Steveeeeeeen

AI & ML interests

speech & audio

Recent Activity

updated a Space 3 days ago

hf-audio/open_asr_leaderboard

upvoted a paper 3 days ago

Training and Inference Efficiency of Encoder-Decoder Speech Models

liked a Space 3 days ago

hexgrad/Kokoro-TTS

View all activity

Organizations

Steveeeeeeen's activity

upvoted a paper 3 days ago

Training and Inference Efficiency of Encoder-Decoder Speech Models

Paper • 2503.05931 • Published 16 days ago • 2

upvoted a paper 6 days ago

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published 17 days ago • 65

upvoted an article 10 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 819

upvoted an article 16 days ago

Article

LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!

16 days ago

• 43

upvoted an article 24 days ago

Article

SigLIP 2: A better multilingual vision language encoder

about 1 month ago

• 142

upvoted an article 25 days ago

Article

Deploying Speech-to-Speech on Hugging Face

Oct 22, 2024

• 39

upvoted 2 collections 25 days ago

OWLS: Scaling Laws for Speech Recognition and Translation

Collection

🦉 A suite of Whisper-style models from 250M to 18B parameters. Trained on up to 360K hours of data. 16k sampling rate. • 7 items • Updated 13 days ago • 4

Open Whisper-style Speech Models (OWSM)

Collection

Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/ • 15 items • Updated Feb 6 • 5

upvoted a paper 26 days ago

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published Feb 19 • 66

upvoted a paper about 1 month ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 166

upvoted an article about 1 month ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 164

upvoted a paper about 1 month ago

Presumed Cultural Identity: How Names Shape LLM Responses

Paper • 2502.11995 • Published Feb 17 • 10

upvoted an article about 1 month ago

Article

Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥

Feb 18

• 94

upvoted a collection about 1 month ago

Feb 14 Releases 💌

Collection

23 items • Updated Feb 14 • 7

upvoted 3 articles about 1 month ago

Article

1 Billion Classifications

Feb 13

• 42

Article

Efficient Controllable Generation for SDXL with T2I-Adapters

Sep 8, 2023

• 7

Article

Introduction to the Open Leaderboard for Japanese LLMs

Nov 20, 2024

• 35