fastinom
/

ASR_fassy

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Edit model card

Model Card for Model ID

Model Details

Model Description

This is the model card of a 🤗 transformers model that has been pushed on the Hub.

Developed by: [Fastino Mateteva]
Model type: [Transformer model]
Language(s) (NLP): [Shona]
License: []

How to Get Started with the Model

Use the code below to get started with the model.

Running the model

Training Details

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 5e-4
per_device_train_batch_size=4
eval_batch_size: 2
evaluation_strategy="steps"
gradient_checkpointing=True
gradient_accumulation_steps: 4
total_train_batch_size: 16
num_train_epochs=3
save_total_limit=1
fp16=True
save_steps=400
eval_steps=200
logging_steps=200
push_to_hub=True

Training results

Training Loss	WER	Step	Validation Loss
6.427	1.00	200	4.1518
3.7979	1.00	400	3.8410
3.6924	1.00	600	3.4249
0.8357	0.26	800	0.2396
0.1528	0.24	1000	0.2155
0.1415	0.24	1200	0.2036
0.1278	0.24	1400	0.2028

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).

Hardware Type: [T4 GPU]
Hours used: [3]
Cloud Provider: [Google Colab]

Technical Specifications [optional]

Model Architecture and Objective

[More Information Needed]

Compute Infrastructure

[More Information Needed]

Hardware

[More Information Needed]

Software

[More Information Needed]

Model Card Authors [optional]

[Fastino Mateteva]

Model Card Contact

[fastinomateteva@gmail.com]

Downloads last month: 29

Safetensors

Model size

965M params

Tensor type

F32

·

Automatic Speech Recognition

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Spaces using fastinom/ASR_fassy 2