File size: 3,195 Bytes
1493f5d
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
109a5c5
 
 
1493f5d
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
99ca729
460e26d
5c62944
c48077e
ac88997
a342149
c23594c
d943c1b
e5d5bb0
a856023
40caa44
53a8097
c75a2f4
09a8371
764eeeb
8e36110
3158432
7c9a842
ed590f0
83f1dd1
a2b9ada
0d6d955
bf8f725
811924d
a3c9a66
7939f49
75ae87e
6b4b52f
b99b238
247abf9
93cfff6
1a1a612
cce713c
77888cf
c789768
0759589
e317b14
73770a6
a6cceaf
07fa5d6
fd23f20
1f24e74
ba2d3bc
128e448
418e099
109a5c5
1493f5d
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
---
license: apache-2.0
base_model: t5-base
tags:
- generated_from_keras_callback
model-index:
- name: JuliusFx/dyu-fr-t5-base_v1
  results: []
---

<!-- This model card has been generated automatically according to the information Keras had access to. You should
probably proofread and complete it, then remove this comment. -->

# JuliusFx/dyu-fr-t5-base_v1

This model is a fine-tuned version of [t5-base](https://huggingface.co/t5-base) on an unknown dataset.
It achieves the following results on the evaluation set:
- Train Loss: 1.3233
- Validation Loss: 3.0376
- Epoch: 46

## Model description

More information needed

## Intended uses & limitations

More information needed

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- optimizer: {'name': 'AdamWeightDecay', 'learning_rate': 2e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False, 'weight_decay_rate': 0.01}
- training_precision: float32

### Training results

| Train Loss | Validation Loss | Epoch |
|:----------:|:---------------:|:-----:|
| 3.2963     | 3.1402          | 0     |
| 3.0454     | 3.0436          | 1     |
| 2.9282     | 3.0219          | 2     |
| 2.8338     | 2.9804          | 3     |
| 2.7463     | 2.9778          | 4     |
| 2.6807     | 2.9307          | 5     |
| 2.6156     | 2.9288          | 6     |
| 2.5501     | 2.9221          | 7     |
| 2.4933     | 2.9245          | 8     |
| 2.4400     | 2.9083          | 9     |
| 2.3910     | 2.9285          | 10    |
| 2.3451     | 2.9178          | 11    |
| 2.2967     | 2.9217          | 12    |
| 2.2496     | 2.9160          | 13    |
| 2.2099     | 2.9176          | 14    |
| 2.1726     | 2.8832          | 15    |
| 2.1317     | 2.9009          | 16    |
| 2.0931     | 2.8764          | 17    |
| 2.0541     | 2.8484          | 18    |
| 2.0237     | 2.8875          | 19    |
| 1.9935     | 2.8943          | 20    |
| 1.9538     | 2.8810          | 21    |
| 1.9218     | 2.8885          | 22    |
| 1.8905     | 2.8650          | 23    |
| 1.8631     | 2.8671          | 24    |
| 1.8290     | 2.8832          | 25    |
| 1.8046     | 2.8879          | 26    |
| 1.7761     | 2.9429          | 27    |
| 1.7414     | 2.9406          | 28    |
| 1.7167     | 2.9296          | 29    |
| 1.6926     | 2.9174          | 30    |
| 1.6639     | 2.9762          | 31    |
| 1.6421     | 2.9700          | 32    |
| 1.6102     | 2.9565          | 33    |
| 1.5877     | 2.9810          | 34    |
| 1.5658     | 2.9643          | 35    |
| 1.5390     | 3.0225          | 36    |
| 1.5152     | 3.0029          | 37    |
| 1.4990     | 2.9756          | 38    |
| 1.4748     | 3.0228          | 39    |
| 1.4483     | 3.0092          | 40    |
| 1.4286     | 3.0356          | 41    |
| 1.4051     | 3.0226          | 42    |
| 1.3841     | 3.0442          | 43    |
| 1.3577     | 3.0595          | 44    |
| 1.3433     | 3.0547          | 45    |
| 1.3233     | 3.0376          | 46    |


### Framework versions

- Transformers 4.38.2
- TensorFlow 2.15.0
- Datasets 2.18.0
- Tokenizers 0.15.2