dipankar53's picture
Update README
6c92528
metadata
license: mit
language:
  - as
base_model:
  - openai/whisper-medium

Assamese Dialect Classification Model

This repository contains a trained model for classifying Assamese dialects based on speech inputs. The model was developed to assist in identifying and understanding regional variations of the Assamese language.


Model Purpose

The purpose of this model is to classify different dialects of Assamese speech. It is useful for linguistic research, speech analysis, and creating dialect-aware applications in natural language processing (NLP) and automatic speech recognition (ASR).


Dialects Recognized

The model is trained to recognize the following four dialects of the Assamese language:

  1. Darangia
  2. Kamrupia
  3. Nalbaria
  4. Upper Assam

Training Dataset

The model was trained on a dataset of 300 speech samples, curated to include diverse speakers, phrases, and dialect features. The dataset includes:

  • Diverse Data: Various accents, speaker genders, and age groups.
  • Metadata: Information about speaker age, gender, district, and speech duration.
  • Common Phrases: Speech samples based on frequently used phrases in Assamese.