atlasia/DODa-audio-dataset
Viewer • Updated • 12.7k • 329 • 20
How to use Hafsa0/whisper-large-v3-darija-lora with PEFT:
from peft import PeftModel
from transformers import AutoModelForSeq2SeqLM
base_model = AutoModelForSeq2SeqLM.from_pretrained("openai/whisper-large-v3")
model = PeftModel.from_pretrained(base_model, "Hafsa0/whisper-large-v3-darija-lora")Modèle Whisper large-v3 fine-tuné avec LoRA sur le dataset DODA pour la transcription automatique du dialecte marocain (Darija).
from transformers import WhisperForConditionalGeneration, WhisperProcessor
from peft import PeftModel
import torch
base = WhisperForConditionalGeneration.from_pretrained(
"openai/whisper-large-v3",
torch_dtype=torch.float16,
device_map="auto"
)
model = PeftModel.from_pretrained(base, "Hafsa0/whisper-large-v3-darija-lora")
processor = WhisperProcessor.from_pretrained("Hafsa0/whisper-large-v3-darija-lora")
# Transcription
inputs = processor(audio_array, sampling_rate=16000, return_tensors="pt")
with torch.no_grad():
ids = model.generate(inputs.input_features.half().to("cuda"),
language="arabic", task="transcribe")
print(processor.tokenizer.decode(ids[0], skip_special_tokens=True))
Base model
openai/whisper-large-v3