Spaces:

Xenova
/

realtime-whisper-webgpu

Running

App Files Files Community

VRAM usage

#1

by SamoXXX - opened Jun 8

SamoXXX

Jun 8

I used the demo - the model was downloaded and the interference works really nicely even in other languages. The only thing that surprises me is why the VRAM usage on the GPU keeps increasing even though I don't have the microphone on.

I also ran into the problem that the model, after showing the correct text that I just said, suddenly starts hallucinating by reducing the output to e.g. [laughs].

bsmani

Jun 8

give me the code to inference the model ....

SamoXXX

Jun 8

I just used this demo - Xenova/realtime-whisper-webgpu - and the VRAM on the GPU over time was taken up almost 100% in 2 minutes

bsmani

Jun 8

thanks bro and its possible to inference the model using transformers code? if you know the answer please tell me.

Jun 8

I need this code ....

Sep 3

Here is the code for anyone who needs it

https://github.com/xenova/transformers.js/tree/v3/examples/webgpu-whisper

Xenova

Owner about 14 hours ago

Hi again! This is now fixed!

Xenova changed discussion status to closed about 14 hours ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment