Smoliakov's picture

Smoliakov PRO

Yehor

·

https://t.me/doing_something

AI & ML interests

Speech-to-Text, Text-to-Speech, Voice over Internet Protocol

Recent Activity

liked a Space about 1 hour ago

alexakup05/eye

published a Space 1 day ago

Yehor/question-classifier-demo

updated a Space 1 day ago

Yehor/question-classifier-demo

View all activity

Organizations

Yehor's activity

upvoted an article 2 days ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

3 days ago

• 241

upvoted a collection 2 days ago

Gemma 3

All versions of Google's new multimodal models in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 29 items • Updated about 8 hours ago • 32

upvoted 2 collections 4 days ago

MT Quality Estimation

Models for reference-free quality estimation of machine translation • 10 items • Updated Jan 29 • 2

GTE models

General Text Embedding Models Released by Tongyi Lab of Alibaba Group • 21 items • Updated Jan 21 • 25

upvoted 2 collections 16 days ago

Ukrainian Speech-to-Text models

4 items • Updated 15 days ago • 1

OWLS: Scaling Laws for Speech Recognition and Translation

🦉 A suite of Whisper-style models from 250M to 18B parameters. Trained on up to 360K hours of data. 16k sampling rate. • 7 items • Updated 4 days ago • 4

upvoted an article 17 days ago

Article

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

By

and 1 other •

Feb 11

• 26

upvoted 2 collections 17 days ago

NeMo Curator - Classifier Models

Classifier models that can be used in NeMo Curator for labelling/filtering datasets. • 11 items • Updated 28 days ago • 16

Ukrainian Text-to-Speech datasets

Five voices: Mykyta, Oleksa, Lada, Kateryna or Tetiana • 6 items • Updated 16 days ago • 4

upvoted a collection 18 days ago

Crimean Tatar Text-to-Speech datasets

Three voices: Abibullah, Sevil, or Arslan • 4 items • Updated 16 days ago • 2

upvoted a paper about 2 months ago

Setting up the Data Printer with Improved English to Ukrainian Machine Translation

Paper • 2404.15196 • Published Apr 23, 2024 • 1

upvoted 3 papers over 1 year ago

Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition

Paper • 2310.06434 • Published Oct 10, 2023 • 4

Retrieval-Augmented Text-to-Audio Generation

Paper • 2309.08051 • Published Sep 14, 2023 • 7

AudioSR: Versatile Audio Super-resolution at Scale

Paper • 2309.07314 • Published Sep 13, 2023 • 27