Problems with timestamps

by jalberth - opened 3 days ago

3 days ago

I can't get return_timestamps=True to save the transcription with timestamps. I'm using the simple code from the model card and saving the res variable as JSON.

d93

3 days ago

•

edited 3 days ago

pipe = pipeline(
"automatic-speech-recognition",
model=model,
tokenizer=processor.tokenizer,
feature_extractor=processor.feature_extractor,
torch_dtype=torch_dtype,
device=device,
return_timestamps=True,
)

I tried with a short audio.mp3, so I had to decrease chunk_length_s to 10 in the call to pipe to get it to work.

jalberth

about 13 hours ago

Perfect, now I got it to work. I wrongly put the argument in the generate_kwargs.

thx for the clarification.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment