Update README.md
Browse files
README.md
CHANGED
@@ -18,36 +18,13 @@ should probably proofread and complete it, then remove this comment. -->
|
|
18 |
|
19 |
|
20 |
Los siguientes son los resultados sobre el conjunto de evaluaci贸n:
|
21 |
-
|
22 |
-
'eval_precision': 0.8704318936877077,
|
23 |
-
'eval_recall': 0.8833942118572633,
|
24 |
-
'eval_f1': 0.8768651513038628,
|
25 |
-
'eval_accuracy': 0.982701988941157,
|
26 |
|
27 |
-
|
28 |
-
'recall': 0.8238007380073801,
|
29 |
-
'f1': 0.8541367766618843,
|
30 |
-
'number': 1084},
|
31 |
-
|
32 |
-
'eval_MISC': {'precision': 0.7349726775956285,
|
33 |
-
'recall': 0.7911764705882353,
|
34 |
-
'f1': 0.7620396600566574,
|
35 |
-
'number': 340},
|
36 |
-
|
37 |
-
'eval_ORG': {'precision': 0.8400272294077604,
|
38 |
-
'recall': 0.8814285714285715,
|
39 |
-
'f1': 0.8602300453119553,
|
40 |
-
'number': 1400},
|
41 |
-
|
42 |
-
'eval_PER': {'precision': 0.9599465954606141,
|
43 |
-
'recall': 0.9782312925170068,
|
44 |
-
'f1': 0.9690026954177898,
|
45 |
-
'number': 735},
|
46 |
-
|
47 |
-
'eval_runtime': 3.6357,
|
48 |
-
'eval_samples_per_second': 417.526,
|
49 |
-
'eval_steps_per_second': 26.13,
|
50 |
-
'epoch': 5.0}
|
51 |
|
52 |
## Model description
|
53 |
|
@@ -57,10 +34,40 @@ con el fin de almacenar de manera temoporal el modelo que pesa alrededor de 35 G
|
|
57 |
|
58 |
|
59 |
The following hyperparameters were used during training:
|
60 |
-
|
61 |
-
|
62 |
-
-
|
63 |
-
|
64 |
-
|
65 |
-
|
66 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18 |
|
19 |
|
20 |
Los siguientes son los resultados sobre el conjunto de evaluaci贸n:
|
21 |
+
- 'eval_loss': 0.0929097980260849,
|
22 |
+
- 'eval_precision': 0.8704318936877077,
|
23 |
+
- 'eval_recall': 0.8833942118572633,
|
24 |
+
- 'eval_f1': 0.8768651513038628,
|
25 |
+
- 'eval_accuracy': 0.982701988941157,
|
26 |
|
27 |
+
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
28 |
|
29 |
## Model description
|
30 |
|
|
|
34 |
|
35 |
|
36 |
The following hyperparameters were used during training:
|
37 |
+
eval_strategy="epoch",
|
38 |
+
save_strategy="epoch",
|
39 |
+
learning_rate=2e-5, # (Aprendizaje se esta cambiando)
|
40 |
+
per_device_train_batch_size=16,
|
41 |
+
per_device_eval_batch_size=16,
|
42 |
+
num_train_epochs=5,
|
43 |
+
weight_decay=0.1,
|
44 |
+
max_grad_norm=1.0,
|
45 |
+
adam_epsilon=1e-5,
|
46 |
+
fp16=True,
|
47 |
+
save_total_limit=2,
|
48 |
+
load_best_model_at_end=True,
|
49 |
+
push_to_hub=True,
|
50 |
+
metric_for_best_model="f1",
|
51 |
+
seed=42,
|
52 |
+
|
53 |
+
|
54 |
+
|
55 |
+
| Metric | Value |
|
56 |
+
|-----------------|-------------|
|
57 |
+
| eval_loss | 0.12918254733085632 |
|
58 |
+
| eval_precision | 0.8674463937621832 |
|
59 |
+
| eval_recall | 0.8752458555774094 |
|
60 |
+
| eval_f1 | 0.8713286713286713 |
|
61 |
+
| eval_accuracy | 0.9813980358174466 |
|
62 |
+
| eval_runtime | 3.6357 |
|
63 |
+
| eval_samples_per_second | 417.526 |
|
64 |
+
| eval_steps_per_second | 26.13 |
|
65 |
+
| epoch | 5.0 |
|
66 |
+
|
67 |
+
| Label | Precision | Recall | F1 | Number |
|
68 |
+
|--------|-----------|--------|------------|--------|
|
69 |
+
| LOC | 0.8867924528301887 | 0.8238007380073801 | 0.8541367766618843 | 1084 |
|
70 |
+
| MISC | 0.7349726775956285 | 0.7911764705882353 | 0.7620396600566574 | 340 |
|
71 |
+
| ORG | 0.8400272294077604 | 0.8814285714285715 | 0.8602300453119553 | 1400 |
|
72 |
+
| PER | 0.9599465954606141 | 0.9782312925170068 | 0.9690026954177898 | 735 |
|
73 |
+
|