Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
672
1
41
Sanchit Gandhi
sanchit-gandhi
Follow
sthennarasan's profile picture
snolyai's profile picture
BrigitteTousi's profile picture
341 followers
·
13 following
sanchitgandhi99
sanchit-gandhi
AI & ML interests
Open-Source Speech
Articles
TTS Arena: Benchmarking Text-to-Speech Models in the Wild
Feb 27
•
10
Speculative Decoding for 2x Faster Whisper Inference
Dec 20, 2023
•
3
AudioLDM 2, but faster ⚡️
Aug 30, 2023
•
1
A Complete Guide to Audio Datasets
Dec 15, 2022
Fine-Tune Whisper with 🤗 Transformers
Nov 3, 2022
•
8
Organizations
sanchit-gandhi
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
sanchit-gandhi/musicgen-streaming
4 days ago
Song doesn't appear to play (regardless of any browser)
3
#5 opened 11 days ago by
Nothsa
New activity in
openai/whisper-large-v3
4 days ago
How to get accuracy of transcription from the model?
5
#98 opened 28 days ago by
Atulad
How we can use this model to achieve a real-time trans?
4
#99 opened 17 days ago by
Von-violet
New activity in
parler-tts/parler_tts_mini
9 days ago
Fixed . on a different line.
1
#2 opened 13 days ago by
blaise-tk
New activity in
parler-tts/parler_tts_mini
11 days ago
minor ui fix
1
#4 opened 13 days ago by
mrfakename
New activity in
parler-tts/parler_tts_mini_v0.1
13 days ago
Inference speed
3
#2 opened 13 days ago by
andreasrath
Link model to the training datasets in metadata
1
#3 opened 13 days ago by
julien-c
Add training datasets to metadata
1
#5 opened 13 days ago by
sanchit-gandhi
Update README.md
#4 opened 13 days ago by
sanchit-gandhi
New activity in
distil-whisper/distil-large-v3
20 days ago
Update alignment heads in gen config
#3 opened 20 days ago by
sanchit-gandhi
New activity in
facebook/voxpopuli
21 days ago
LICENSE question
2
#8 opened about 1 month ago by
phoneme
New activity in
sanchit-gandhi/musicgen-streaming
21 days ago
Streaming doesn't work yet with gradio 4.0
#4 opened 21 days ago by
ylacombe
New activity in
distil-whisper/distil-large-v3
28 days ago
about multiple languages?
2
#2 opened about 1 month ago by
obtion
New activity in
sanchit-gandhi/whisper-small-hi
29 days ago
Adding `safetensors` variant of this model
#17 opened 5 months ago by
SFconvertbot
New activity in
facebook/wav2vec2-lv-60-espeak-cv-ft
29 days ago
Adding `safetensors` variant of this model
1
#4 opened 5 months ago by
SFconvertbot
New activity in
facebook/wav2vec2-large-xlsr-53
29 days ago
Adding `safetensors` variant of this model
1
#3 opened about 2 months ago by
SFconvertbot
New activity in
facebook/wav2vec2-base
29 days ago
Adding `safetensors` variant of this model
1
#2 opened 4 months ago by
SFconvertbot
New activity in
distil-whisper/distil-large-v3-ct2
29 days ago
Update README.md
3
#2 opened about 1 month ago by
muhtasham
New activity in
distil-whisper/distil-large-v3-ggml
29 days ago
is it fp16?
3
#1 opened 30 days ago by
supercharge19
New activity in
distil-whisper/distil-medium.en
29 days ago
Just can't run!
3
#14 opened about 1 month ago by
awesomeandy
New activity in
distil-whisper/distil-large-v3-ct2
about 1 month ago
Update alignment heads
#1 opened about 1 month ago by
sanchit-gandhi
New activity in
distil-whisper/distil-large-v3
about 1 month ago
How to do multilingual transcription?
3
#1 opened about 1 month ago by
emraza110
New activity in
facebook/mms-tts-tao
about 1 month ago
Reference of the Dataset
1
#1 opened about 1 month ago by
ChiaLingWeng
New activity in
openai/whisper-large-v3
about 1 month ago
How to save the loss value for each step during the training process?
2
#91 opened about 1 month ago by
zhouwen999
New activity in
hf-audio/open_asr_leaderboard
about 1 month ago
[Average WER Calculation] Drop Common Voice WER.
4
#14 opened about 2 months ago by
reach-vb
New activity in
openai/whisper-large-v3
about 2 months ago
Transcript an Spanish audio
3
#86 opened about 2 months ago by
Andrews99
New activity in
sanchit-gandhi/whisper-medium-fleurs-lang-id
about 2 months ago
How do you fine tune Whisper for classification task rather than transcription?
6
#1 opened about 1 year ago by
nkburns
New activity in
openai/whisper-large-v2
about 2 months ago
Add missing merge to tokenizer
#100 opened about 2 months ago by
sanchit-gandhi
New activity in
openai/whisper-large
about 2 months ago
Add missing merge to tokenizer
#50 opened about 2 months ago by
sanchit-gandhi
New activity in
openai/whisper-medium
about 2 months ago
Add missing merge to tokenizer
#36 opened about 2 months ago by
sanchit-gandhi
New activity in
openai/whisper-small
about 2 months ago
Add missing merge to tokenizer
#38 opened about 2 months ago by
sanchit-gandhi
New activity in
openai/whisper-tiny
about 2 months ago
Add missing merge to tokenizer
#40 opened about 2 months ago by
sanchit-gandhi
New activity in
openai/whisper-base
about 2 months ago
Upload tokenizer
2
#28 opened 4 months ago by
ArthurZ
New activity in
sanchit-gandhi/large-v3-32-2-conditioned-prompt-logic-timestamped-resumed-pt
about 2 months ago
Update generation_config.json
#2 opened about 2 months ago by
sanchit-gandhi
Update generation_config.json
#1 opened about 2 months ago by
sanchit-gandhi
New activity in
facebook/s2t-wav2vec2-large-en-de
about 2 months ago
Updates incorrect tokenizer configuration file
1
#3 opened 2 months ago by
lysandre
New activity in
kakao-enterprise/vits-vctk
2 months ago
List of all available speakers?
2
#2 opened 2 months ago by
Nikerino
New activity in
facebook/mms-tts-eng
2 months ago
What kind of dataset was used?
1
#8 opened 2 months ago by
f0rGoTTen000
New activity in
distil-whisper/whisper-vs-distil-whisper
2 months ago
Distil version does a bad job at Transcribing
3
#2 opened 2 months ago by
arslankas
New activity in
google/gemma-7b-it
2 months ago
error model.generate()
14
#13 opened 2 months ago by
NickyNicky
New activity in
facebook/musicgen-melody
2 months ago
Upload MusicgenMelodyForConditionalGeneration
#8 opened 2 months ago by
ylacombe
Upload processor
#9 opened 2 months ago by
ylacombe
New activity in
facebook/musicgen-stereo-melody
2 months ago
Upload MusicgenMelodyForConditionalGeneration
#2 opened 2 months ago by
ylacombe
Upload processor
#3 opened 2 months ago by
ylacombe
New activity in
facebook/musicgen-stereo-melody-large
2 months ago
Upload MusicgenMelodyForConditionalGeneration
#2 opened 2 months ago by
ylacombe
Upload processor
#3 opened 2 months ago by
ylacombe
New activity in
facebook/musicgen-melody-large
2 months ago
Upload MusicgenMelodyForConditionalGeneration
#3 opened 2 months ago by
ylacombe
Upload processor
1
#4 opened 2 months ago by
ylacombe
New activity in
google/gemma-7b
2 months ago
Upload FlaxGemmaForCausalLM
1
#3 opened 2 months ago by
pcuenq
New activity in
facebook/mms-tts-tam
2 months ago
AttributeError
1
#1 opened 2 months ago by
murthy1998
Fix code examples for transformers
#2 opened 2 months ago by
sanchit-gandhi
New activity in
hf-audio/open_asr_leaderboard
2 months ago
Smaller model sizes lead to worse RTF on Whisper
2
#8 opened 3 months ago by
lorenzopark
Define RTF
#12 opened 2 months ago by
sanchit-gandhi
New activity in
openai/whisper-large-v3
3 months ago
Update forced decoder ids
#79 opened 3 months ago by
sanchit-gandhi
model in closed network
3
#78 opened 3 months ago by
iamwhoiamm
New activity in
sanchit-gandhi/large-v3-32-2-token-ids-freeze-embeds-label-length-448-unshuffled-filtered-conditioned-pt
3 months ago
Update generation_config.json
#1 opened 3 months ago by
sanchit-gandhi
New activity in
facebook/seamless-m4t-v2-large
3 months ago
Loading the model takes a long time using from_pretrained
1
#29 opened 3 months ago by
Zhaoz1997
New activity in
openai/whisper-large-v2
3 months ago
How to finetune on Kaggle TPU
8
#89 opened 5 months ago by
LukeJacob2023
New activity in
carlosdanielhernandezmena/ravnursson_asr
3 months ago
Dataset Viewer issue: RowsPostProcessingError
3
#2 opened 3 months ago by
carlosdanielhernandezmena
New activity in
sanchit-gandhi/whisper-jax
3 months ago
Error while transcribing and translating via Youtube Link
18
#69 opened 5 months ago by
deependraparmar
Load more