view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 3 days ago • 241
Gemma 3 Collection All versions of Google's new multimodal models in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 29 items • Updated about 8 hours ago • 32
MT Quality Estimation Collection Models for reference-free quality estimation of machine translation • 10 items • Updated Jan 29 • 2
GTE models Collection General Text Embedding Models Released by Tongyi Lab of Alibaba Group • 21 items • Updated Jan 21 • 25
OWLS: Scaling Laws for Speech Recognition and Translation Collection 🦉 A suite of Whisper-style models from 250M to 18B parameters. Trained on up to 360K hours of data. 16k sampling rate. • 7 items • Updated 4 days ago • 4
view article Article From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages By Steveeeeeeen and 1 other • Feb 11 • 26
NeMo Curator - Classifier Models Collection Classifier models that can be used in NeMo Curator for labelling/filtering datasets. • 11 items • Updated 28 days ago • 16
Ukrainian Text-to-Speech datasets Collection Five voices: Mykyta, Oleksa, Lada, Kateryna or Tetiana • 6 items • Updated 16 days ago • 4
Crimean Tatar Text-to-Speech datasets Collection Three voices: Abibullah, Sevil, or Arslan • 4 items • Updated 16 days ago • 2
Setting up the Data Printer with Improved English to Ukrainian Machine Translation Paper • 2404.15196 • Published Apr 23, 2024 • 1
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition Paper • 2310.06434 • Published Oct 10, 2023 • 4