Generative Error Correction & Understanding Community

community

Activity Feed

AI & ML interests

LLM for Text based Speech Processing

Recent Activity

huckiyang updated a model 12 days ago

GenSEC-LLM/SLT-Task1-Llama2-7b-HyPo-baseline

kunaldhawan authored a paper about 2 months ago

The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System

kunaldhawan authored a paper about 2 months ago

Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer

View all activity

GenSEC-LLM's activity

huckiyang

updated a model 12 days ago

GenSEC-LLM/SLT-Task1-Llama2-7b-HyPo-baseline

Text Generation • Updated 12 days ago • 441

kunaldhawan

authored 3 papers about 2 months ago

The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System

Paper • 2310.12378 • Published Oct 18, 2023

Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer

Paper • 2306.08753 • Published Jun 14, 2023 • 1

Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach

Paper • 2309.05248 • Published Sep 11, 2023

huckiyang

updated a Space 2 months ago

No application file

📈

README

YC-Li

updated a dataset 6 months ago

GenSEC-LLM/SLT-Task3-Post-ASR-Emotion-Recognition

Updated Jun 15 • 7 • 1

Taejin

updated a Space 6 months ago

Runtime error

🏅

SLT GenSEC Task2 Speaker Tagging Leaderboard

Taejin

updated a dataset 6 months ago

GenSEC-LLM/SLT-Task2-Post-ASR-Speaker-Tagging

Viewer • Updated Jun 11 • 56.3k • 49 • 1

Taejin

updated a model 7 months ago

GenSEC-LLM/SLT-Task2-ngram-baseline

Updated May 13

huckiyang

updated a dataset 8 months ago

GenSEC-LLM/SLT-Task1-Post-ASR-Text-Correction

Viewer • Updated Apr 29 • 257k • 50 • 1

huckiyang

authored 4 papers about 1 year ago

HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models

Paper • 2309.15701 • Published Sep 27, 2023 • 2

Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition

Paper • 2310.06434 • Published Oct 10, 2023 • 4

Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

Paper • 2309.15223 • Published Sep 26, 2023 • 19

Voice2Series: Reprogramming Acoustic Models for Time Series Classification

Paper • 2106.09296 • Published Jun 17, 2021

yuangongfdu

authored 2 papers over 1 year ago

AST: Audio Spectrogram Transformer

Paper • 2104.01778 • Published Apr 5, 2021 • 2

Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers

Paper • 2307.03183 • Published Jul 6, 2023 • 10

AI & ML interests

Recent Activity

Team members 7

GenSEC-LLM's activity

README

SLT GenSEC Task2 Speaker Tagging Leaderboard