mcamara commited on
Commit
538aa2e
1 Parent(s): 023c735

mcamara/gemma-2b-es-spanishbillionwords

Browse files
README.md CHANGED
@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [google/gemma-2b](https://huggingface.co/google/gemma-2b) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 4.0578
22
 
23
  ## Model description
24
 
@@ -37,7 +37,7 @@ More information needed
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
- - learning_rate: 0.0002
41
  - train_batch_size: 1
42
  - eval_batch_size: 8
43
  - seed: 42
@@ -54,15 +54,15 @@ The following hyperparameters were used during training:
54
  | Training Loss | Epoch | Step | Validation Loss |
55
  |:-------------:|:-----:|:----:|:---------------:|
56
  | 1.2899 | 1.0 | 1 | 4.1511 |
57
- | 1.2899 | 2.0 | 2 | 4.1451 |
58
- | 1.2466 | 3.0 | 3 | 4.1311 |
59
- | 1.1503 | 4.0 | 4 | 4.1179 |
60
- | 1.0691 | 5.0 | 5 | 4.1011 |
61
- | 0.9985 | 6.0 | 6 | 4.0873 |
62
- | 0.9366 | 7.0 | 7 | 4.0770 |
63
- | 0.8845 | 8.0 | 8 | 4.0662 |
64
- | 0.8436 | 9.0 | 9 | 4.0618 |
65
- | 0.8154 | 10.0 | 10 | 4.0578 |
66
 
67
 
68
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [google/gemma-2b](https://huggingface.co/google/gemma-2b) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 4.1108
22
 
23
  ## Model description
24
 
 
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
+ - learning_rate: 0.0001
41
  - train_batch_size: 1
42
  - eval_batch_size: 8
43
  - seed: 42
 
54
  | Training Loss | Epoch | Step | Validation Loss |
55
  |:-------------:|:-----:|:----:|:---------------:|
56
  | 1.2899 | 1.0 | 1 | 4.1511 |
57
+ | 1.2899 | 2.0 | 2 | 4.1486 |
58
+ | 1.269 | 3.0 | 3 | 4.1424 |
59
+ | 1.2206 | 4.0 | 4 | 4.1363 |
60
+ | 1.1768 | 5.0 | 5 | 4.1303 |
61
+ | 1.1391 | 6.0 | 6 | 4.1232 |
62
+ | 1.1083 | 7.0 | 7 | 4.1190 |
63
+ | 1.0829 | 8.0 | 8 | 4.1162 |
64
+ | 1.0633 | 9.0 | 9 | 4.1131 |
65
+ | 1.05 | 10.0 | 10 | 4.1108 |
66
 
67
 
68
  ### Framework versions
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5e7e2943a45bdf8880eec94a90fabdc0ef73d8bc7d79dff7a8a5cfd91cf6da93
3
  size 39256456
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9ab5e319b196c0a9e19d54444d3dbcdb021ed662550ce77e83744a1efff6fae1
3
  size 39256456
runs/Mar11_14-04-49_byo-WS5/events.out.tfevents.1710162290.byo-WS5.256887.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b2b22fb6de8e825a623b0fe36905fe2d56635f89aaf8dc384d9c172068cfb26c
3
+ size 10127
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:edb94c619b340b0d80e7140ca60e5886398961d1a696d0a3115ab0b138bf8bdc
3
  size 4920
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1ffd8063a4009b25c6ac0b77f6bd5247365eaa588138918216b9109515de9911
3
  size 4920