Update README.md
Browse files
README.md
CHANGED
@@ -4,27 +4,32 @@ library_name: transformers
|
|
4 |
language:
|
5 |
- en
|
6 |
- pt
|
|
|
7 |
---
|
8 |
-
#
|
9 |
|
10 |
-
|
11 |
|
12 |
-
The model
|
13 |
|
14 |
-
|
15 |
-
- Size: 23,805,216 parameters
|
16 |
-
- Dataset:
|
17 |
-
- Languages: English, Portuguese
|
18 |
-
- Number of Training
|
19 |
-
- Batch size: 32
|
20 |
-
- Optimizer: rmsprop
|
21 |
-
- Learning Rate: 0.001
|
22 |
-
- Hardware: TeslaT4
|
23 |
-
- This repository has the source [code used](https://github.com/Nkluge-correa/teeny-tiny_castle/blob/master/ML%20Intro%20Course/16_sequence_to_sequence.ipynb) to train this model.
|
24 |
|
25 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
26 |
|
27 |
-
|
|
|
|
|
28 |
import tensorflow as tf
|
29 |
import numpy as np
|
30 |
import string
|
@@ -138,16 +143,41 @@ Portuguese translation:
|
|
138 |
[start] não faça isso [end]
|
139 |
--------------------------------------------------
|
140 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
141 |
# Cite as 🤗
|
142 |
-
```
|
143 |
@misc{teenytinycastle,
|
144 |
doi = {10.5281/zenodo.7112065},
|
145 |
-
url = {https://
|
146 |
author = {Nicholas Kluge Corr{\^e}a},
|
147 |
title = {Teeny-Tiny Castle},
|
148 |
-
year = {
|
149 |
-
publisher = {
|
150 |
-
journal = {
|
|
|
151 |
}
|
152 |
```
|
153 |
## License
|
|
|
4 |
language:
|
5 |
- en
|
6 |
- pt
|
7 |
+
pipeline_tag: translation
|
8 |
---
|
9 |
+
# Transformer-eng-por
|
10 |
|
11 |
+
## Model Overview
|
12 |
|
13 |
+
The transformer-eng-por model is transformer trained to for text classification
|
14 |
|
15 |
+
### Details
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
16 |
|
17 |
+
- **Size:** 23,805,216 parameters
|
18 |
+
- **Model type:** Transformer
|
19 |
+
- **Optimizer**: `rmsprop`
|
20 |
+
- **Number of Epochs:** 30
|
21 |
+
- **Embbedding dimensionality:** 256
|
22 |
+
- **Dense dimensionality:** 2048
|
23 |
+
- **Attention heads:** 8
|
24 |
+
- **Vocabulary size:** 20000
|
25 |
+
- **Sequence lenght:** 20
|
26 |
+
- **Hardware:** Tesla V4
|
27 |
+
- **Emissions:** Not measured
|
28 |
+
- **Total Energy Consumption:** Not measured
|
29 |
|
30 |
+
### How to Use
|
31 |
+
|
32 |
+
```python
|
33 |
import tensorflow as tf
|
34 |
import numpy as np
|
35 |
import string
|
|
|
143 |
[start] não faça isso [end]
|
144 |
--------------------------------------------------
|
145 |
```
|
146 |
+
## Intended Use
|
147 |
+
|
148 |
+
This model was created for research purposes only. Specifically, it was designed to translate sentences from English to Portuguese. We do not recommend any application of this model outside this scope.
|
149 |
+
|
150 |
+
|
151 |
+
## Performance Metrics
|
152 |
+
|
153 |
+
Accuracy: 76,46%
|
154 |
+
|
155 |
+
|
156 |
+
## Training Data
|
157 |
+
|
158 |
+
[English-portuguese translation](https://www.kaggle.com/datasets/nageshsingh/englishportuguese-translation).
|
159 |
+
|
160 |
+
The dataset consists of a set of English and Portuguese sentences.
|
161 |
+
|
162 |
+
## Limitations
|
163 |
+
|
164 |
+
In `What is its name?` to `[start] o que é o seu nome [end]`, the `transformer` model makes a gender assumption,
|
165 |
+
even though the source sentence wasn't gendered (`[start] qual é o nome dele [end]`).
|
166 |
+
Errors like these are not uncommon in NLP, algorithmic bias being one of the great problems associated with using language models
|
167 |
+
in real applications. In conclusion, we do not recommend using this model in real-world applications.
|
168 |
+
It was solely developed for academic and educational purposes.
|
169 |
+
|
170 |
# Cite as 🤗
|
171 |
+
```latex
|
172 |
@misc{teenytinycastle,
|
173 |
doi = {10.5281/zenodo.7112065},
|
174 |
+
url = {https://github.com/Nkluge-correa/teeny-tiny_castle},
|
175 |
author = {Nicholas Kluge Corr{\^e}a},
|
176 |
title = {Teeny-Tiny Castle},
|
177 |
+
year = {2024},
|
178 |
+
publisher = {GitHub},
|
179 |
+
journal = {GitHub repository},
|
180 |
+
note = {Last updated 12 January 2024},
|
181 |
}
|
182 |
```
|
183 |
## License
|