crscardellino commited on
Commit
a15aee5
1 Parent(s): 36cdbc8

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +57 -17
README.md CHANGED
@@ -1,25 +1,65 @@
1
- Hugging Face: IA Colaborativa
2
- =============================
 
 
 
 
 
 
3
 
4
- En este repositorio estará disponible el código y modelo que entrené para la
5
- charla ["Hugging Face: IA Colaborativa"](https://eventol.flisol.org.ar/events/cordoba2023/activity/378/)
6
- del [FLISoL de Córdoba](https://cordoba.flisol.org.ar), Argentina, de 2023.
7
 
8
- Para inicializar el setup hace falta tener instalado y activado
9
- [`git-lfs`](https://git-lfs.com/).
10
 
11
- Pueden clonar el repositorio con:
 
 
12
 
13
- $ git clone https://huggingface.co/crscardellino/flisol-cba-martin-fierro
14
 
15
- Y luego crean el entorno e instalan los requerimientos.
16
 
17
- $ python -m venv flisol-venv
18
- $ source ./flisol-venv/bin/activate
19
- (flisol-venv) $ pip install -r requirements.txt
20
 
21
- El código está probado con Python 3.10, pero debería funcionar con Python >=
22
- 3.8. En los requerimientos está organizado para instalar
23
- [PyTorch](https://pytorch.org/) v2.0.0 para cpu, pero pueden ajustarlo para
24
- utilizar GPUs suponiendo que cumplan los requerimientos de CUDA.
25
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ tags:
4
+ - generated_from_trainer
5
+ model-index:
6
+ - name: flisol-cba-martin-fierro
7
+ results: []
8
+ ---
9
 
10
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
11
+ should probably proofread and complete it, then remove this comment. -->
 
12
 
13
+ # flisol-cba-martin-fierro
 
14
 
15
+ This model is a fine-tuned version of [DeepESP/gpt2-spanish](https://huggingface.co/DeepESP/gpt2-spanish) on the None dataset.
16
+ It achieves the following results on the evaluation set:
17
+ - Loss: 3.9067
18
 
19
+ ## Model description
20
 
21
+ More information needed
22
 
23
+ ## Intended uses & limitations
 
 
24
 
25
+ More information needed
 
 
 
26
 
27
+ ## Training and evaluation data
28
+
29
+ More information needed
30
+
31
+ ## Training procedure
32
+
33
+ ### Training hyperparameters
34
+
35
+ The following hyperparameters were used during training:
36
+ - learning_rate: 2e-05
37
+ - train_batch_size: 8
38
+ - eval_batch_size: 8
39
+ - seed: 42
40
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
41
+ - lr_scheduler_type: linear
42
+ - num_epochs: 10
43
+
44
+ ### Training results
45
+
46
+ | Training Loss | Epoch | Step | Validation Loss |
47
+ |:-------------:|:-----:|:----:|:---------------:|
48
+ | 4.3864 | 1.0 | 18 | 4.2025 |
49
+ | 3.948 | 2.0 | 36 | 4.0440 |
50
+ | 3.7962 | 3.0 | 54 | 3.9804 |
51
+ | 3.6105 | 4.0 | 72 | 3.9458 |
52
+ | 3.4444 | 5.0 | 90 | 3.9280 |
53
+ | 3.3855 | 6.0 | 108 | 3.9192 |
54
+ | 3.3142 | 7.0 | 126 | 3.9091 |
55
+ | 3.2192 | 8.0 | 144 | 3.9074 |
56
+ | 3.1615 | 9.0 | 162 | 3.9070 |
57
+ | 3.1637 | 10.0 | 180 | 3.9067 |
58
+
59
+
60
+ ### Framework versions
61
+
62
+ - Transformers 4.28.1
63
+ - Pytorch 2.0.0+cpu
64
+ - Datasets 2.11.0
65
+ - Tokenizers 0.13.3