File size: 2,308 Bytes
1820a76
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
0e567e8
1820a76
0cbca66
c532c5d
 
 
 
1820a76
3a28a64
1820a76
 
 
80fe41f
6563141
4d71c2d
1820a76
 
 
 
e1d8c4c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
f4c509a
 
 
 
 
 
 
1820a76
 
 
 
 
 
 
 
 
 
f4c509a
 
 
 
1820a76
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
f4c509a
1820a76
 
 
f4c509a
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
---
library_name: transformers
tags: []
---

# Model Card for Model ID

<!-- Provide a quick summary of what the model is/does. -->



## Model Details

### Model Description

<!-- Provide a longer summary of what this model is. -->

This is the model card of a 🤗 transformers model that has been pushed on the Hub. 

- **Developed by:** [Fastino Mateteva]
- **Model type:** [Transformer model]
- **Language(s) (NLP):** [Shona]
- **License:** []


### How to Get Started with the Model

Use the code below to get started with the model.

  
## Running the model 

## Training Details



### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 5e-4
- per_device_train_batch_size=4
- eval_batch_size: 2
- evaluation_strategy="steps"
- gradient_checkpointing=True
- gradient_accumulation_steps: 4
- total_train_batch_size: 16
- num_train_epochs=3
- save_total_limit=1
- fp16=True
- save_steps=400
- eval_steps=200
- logging_steps=200
- push_to_hub=True

### Training results

| Training Loss | WER   | Step | Validation Loss |
|:-------------:|:-----:|:----:|:---------------:|
| 6.427         | 1.00  | 200   | 4.1518          |
| 3.7979        | 1.00  | 400   | 3.8410          |
| 3.6924        | 1.00  | 600   | 3.4249          |
| 0.8357        | 0.26  | 800   | 0.2396          |
| 0.1528        | 0.24  | 1000  | 0.2155          |
| 0.1415        | 0.24  | 1200  | 0.2036          |
| 0.1278        | 0.24  | 1400  | 0.2028          |




## Environmental Impact

<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->

Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).

- **Hardware Type:** [T4 GPU]
- **Hours used:** [3]
- **Cloud Provider:** [Google Colab]


## Technical Specifications [optional]

### Model Architecture and Objective

[More Information Needed]

### Compute Infrastructure

[More Information Needed]

#### Hardware

[More Information Needed]

#### Software

[More Information Needed]


## Model Card Authors [optional]

[Fastino Mateteva]

## Model Card Contact

[fastinomateteva@gmail.com]