Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
687
2
41
Sanchit Gandhi
sanchit-gandhi
Follow
florentgbelidji's profile picture
lnyhrd's profile picture
21world's profile picture
390 followers
·
14 following
sanchitgandhi99
sanchit-gandhi
AI & ML interests
Open-Source Speech
Articles
TTS Arena: Benchmarking Text-to-Speech Models in the Wild
Feb 27
•
19
Speculative Decoding for 2x Faster Whisper Inference
Dec 20, 2023
•
12
AudioLDM 2, but faster ⚡️
Aug 30, 2023
•
1
A Complete Guide to Audio Datasets
Dec 15, 2022
•
6
Fine-Tune Whisper with 🤗 Transformers
Nov 3, 2022
•
38
Organizations
sanchit-gandhi
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
openai/whisper-large-v3
2 days ago
Update README.md
2
#126 opened 2 days ago by
reach-vb
New activity in
distil-whisper/distil-large-v3
2 days ago
Update README.md
1
#5 opened 2 days ago by
reach-vb
New activity in
distil-whisper/distil-medium.en
2 days ago
Just can't run!
7
#14 opened 3 months ago by
awesomeandy
New activity in
facebook/mms-tts-eng
2 days ago
Create preprocessor_config.json
2
#12 opened 4 days ago by
adityaedy01
New activity in
facebook/mms-tts-som
3 days ago
where is the preprocessor_config.json for this model?
1
#1 opened 4 days ago by
adityaedy01
New activity in
facebook/wav2vec2-xls-r-1b-21-to-en
4 days ago
Incorrect config file
4
#5 opened 3 months ago by
shrey-jasuja
Update Example Code Snippets
#6 opened 4 days ago by
sanchit-gandhi
New activity in
Aspik101/distil-whisper-large-v3-pl
4 days ago
Model Discussion
7
#2 opened 5 months ago by
sanchit-gandhi
New activity in
facebook/wav2vec2-large-960h-lv60-self
18 days ago
facing issues while using access token of the following model facebook/wav2vec2-large-960h-lv60-self
1
#8 opened 23 days ago by
Webster9
New activity in
openai/whisper-large-v3
18 days ago
KeyError: 'whisper'
1
#116 opened 18 days ago by
aiyaqingzheng
New activity in
parler-tts/parler-tts-mini-expresso
19 days ago
What to use for [train] ? pip install -e .[train]
2
#2 opened 19 days ago by
Kimsui
New activity in
openai/whisper-large-v3
23 days ago
how to transcribe hundreds of local audio files once?
1
#114 opened 24 days ago by
myspace-ai
New activity in
sweet-dreambooths/musicgen-songstarter-v0.2-hf
about 1 month ago
Upload processor
#2 opened about 1 month ago by
sanchit-gandhi
Upload MusicgenMelodyForConditionalGeneration
#1 opened about 1 month ago by
sanchit-gandhi
New activity in
facebook/voxpopuli
about 1 month ago
Error loading dataset
2
#9 opened about 2 months ago by
jorgetebl
New activity in
LIUM/tedlium
about 1 month ago
FileNotFoundError when loading the LIUM/tedlium data on Windows
4
#4 opened 3 months ago by
wondav
New activity in
sanchit-gandhi/musicgen-streaming
about 2 months ago
Song doesn't appear to play (regardless of any browser)
3
#5 opened about 2 months ago by
Nothsa
New activity in
openai/whisper-large-v3
about 2 months ago
How to get accuracy of transcription from the model?
5
#98 opened 2 months ago by
Atulad
How we can use this model to achieve a real-time trans?
4
#99 opened 2 months ago by
Von-violet
New activity in
parler-tts/parler_tts_mini
about 2 months ago
Fixed . on a different line.
1
#2 opened about 2 months ago by
blaise-tk
minor ui fix
1
#4 opened about 2 months ago by
mrfakename
New activity in
parler-tts/parler_tts_mini_v0.1
about 2 months ago
Inference speed
6
#2 opened about 2 months ago by
andreasrath
Link model to the training datasets in metadata
1
#3 opened about 2 months ago by
julien-c
Add training datasets to metadata
1
#5 opened about 2 months ago by
sanchit-gandhi
Update README.md
#4 opened about 2 months ago by
sanchit-gandhi
New activity in
distil-whisper/distil-large-v3
2 months ago
Update alignment heads in gen config
#3 opened 2 months ago by
sanchit-gandhi
New activity in
facebook/voxpopuli
2 months ago
LICENSE question
2
#8 opened 3 months ago by
phoneme
New activity in
sanchit-gandhi/musicgen-streaming
2 months ago
Streaming doesn't work yet with gradio 4.0
#4 opened 2 months ago by
ylacombe
New activity in
distil-whisper/distil-large-v3
2 months ago
about multiple languages?
2
#2 opened 3 months ago by
obtion
New activity in
sanchit-gandhi/whisper-small-hi
3 months ago
Adding `safetensors` variant of this model
#17 opened 7 months ago by
SFconvertbot
New activity in
facebook/wav2vec2-lv-60-espeak-cv-ft
3 months ago
Adding `safetensors` variant of this model
1
#4 opened 7 months ago by
SFconvertbot
New activity in
facebook/wav2vec2-large-xlsr-53
3 months ago
Adding `safetensors` variant of this model
1
#3 opened 3 months ago by
SFconvertbot
New activity in
facebook/wav2vec2-base
3 months ago
Adding `safetensors` variant of this model
1
#2 opened 6 months ago by
SFconvertbot
New activity in
distil-whisper/distil-large-v3-ct2
3 months ago
Update README.md
3
#2 opened 3 months ago by
muhtasham
New activity in
distil-whisper/distil-large-v3-ggml
3 months ago
is it fp16?
3
#1 opened 3 months ago by
supercharge19
New activity in
distil-whisper/distil-large-v3-ct2
3 months ago
Update alignment heads
#1 opened 3 months ago by
sanchit-gandhi
New activity in
distil-whisper/distil-large-v3
3 months ago
How to do multilingual transcription?
3
#1 opened 3 months ago by
emraza110
New activity in
facebook/mms-tts-tao
3 months ago
Reference of the Dataset
1
#1 opened 3 months ago by
ChiaLingWeng
New activity in
openai/whisper-large-v3
3 months ago
How to save the loss value for each step during the training process?
2
#91 opened 3 months ago by
zhouwen999
New activity in
hf-audio/open_asr_leaderboard
3 months ago
[Average WER Calculation] Drop Common Voice WER.
4
#14 opened 3 months ago by
reach-vb
New activity in
openai/whisper-large-v3
3 months ago
Transcript an Spanish audio
4
#86 opened 3 months ago by
Andrews99
New activity in
sanchit-gandhi/whisper-medium-fleurs-lang-id
3 months ago
How do you fine tune Whisper for classification task rather than transcription?
6
#1 opened about 1 year ago by
nkburns
New activity in
openai/whisper-large-v2
3 months ago
Add missing merge to tokenizer
#100 opened 3 months ago by
sanchit-gandhi
New activity in
openai/whisper-large
3 months ago
Add missing merge to tokenizer
#50 opened 3 months ago by
sanchit-gandhi
New activity in
openai/whisper-medium
3 months ago
Add missing merge to tokenizer
#36 opened 3 months ago by
sanchit-gandhi
New activity in
openai/whisper-small
3 months ago
Add missing merge to tokenizer
#38 opened 3 months ago by
sanchit-gandhi
New activity in
openai/whisper-tiny
3 months ago
Add missing merge to tokenizer
#40 opened 3 months ago by
sanchit-gandhi
New activity in
openai/whisper-base
3 months ago
Upload tokenizer
2
#28 opened 6 months ago by
ArthurZ
New activity in
sanchit-gandhi/large-v3-32-2-conditioned-prompt-logic-timestamped-resumed-pt
3 months ago
Update generation_config.json
#2 opened 3 months ago by
sanchit-gandhi
Update generation_config.json
#1 opened 3 months ago by
sanchit-gandhi
New activity in
facebook/s2t-wav2vec2-large-en-de
3 months ago
Updates incorrect tokenizer configuration file
1
#3 opened 4 months ago by
lysandre
New activity in
kakao-enterprise/vits-vctk
4 months ago
List of all available speakers?
2
#2 opened 4 months ago by
Nikerino
New activity in
facebook/mms-tts-eng
4 months ago
What kind of dataset was used?
1
#8 opened 4 months ago by
f0rGoTTen000
New activity in
distil-whisper/whisper-vs-distil-whisper
4 months ago
Distil version does a bad job at Transcribing
3
#2 opened 4 months ago by
arslankas
New activity in
google/gemma-7b-it
4 months ago
error model.generate()
14
#13 opened 4 months ago by
NickyNicky
New activity in
facebook/musicgen-melody
4 months ago
Upload MusicgenMelodyForConditionalGeneration
#8 opened 4 months ago by
ylacombe
Upload processor
#9 opened 4 months ago by
ylacombe
New activity in
facebook/musicgen-stereo-melody
4 months ago
Upload MusicgenMelodyForConditionalGeneration
#2 opened 4 months ago by
ylacombe
Upload processor
#3 opened 4 months ago by
ylacombe
New activity in
facebook/musicgen-stereo-melody-large
4 months ago
Upload MusicgenMelodyForConditionalGeneration
#2 opened 4 months ago by
ylacombe
Load more