mcamara/gemma-2b-es-spanishbillionwords

Files changed (4) hide show

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/gemma-2b](https://huggingface.co/google/gemma-2b) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.0578
 ## Model description
@@ -37,7 +37,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0002
 - train_batch_size: 1
 - eval_batch_size: 8
 - seed: 42
@@ -54,15 +54,15 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
 | 1.2899        | 1.0   | 1    | 4.1511          |
-| 1.2899        | 2.0   | 2    | 4.1451          |
-| 1.2466        | 3.0   | 3    | 4.1311          |
-| 1.1503        | 4.0   | 4    | 4.1179          |
-| 1.0691        | 5.0   | 5    | 4.1011          |
-| 0.9985        | 6.0   | 6    | 4.0873          |
-| 0.9366        | 7.0   | 7    | 4.0770          |
-| 0.8845        | 8.0   | 8    | 4.0662          |
-| 0.8436        | 9.0   | 9    | 4.0618          |
-| 0.8154        | 10.0  | 10   | 4.0578          |
 ### Framework versions

 This model is a fine-tuned version of [google/gemma-2b](https://huggingface.co/google/gemma-2b) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 4.1108
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0001
 - train_batch_size: 1
 - eval_batch_size: 8
 - seed: 42
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
 | 1.2899        | 1.0   | 1    | 4.1511          |
+| 1.2899        | 2.0   | 2    | 4.1486          |
+| 1.269         | 3.0   | 3    | 4.1424          |
+| 1.2206        | 4.0   | 4    | 4.1363          |
+| 1.1768        | 5.0   | 5    | 4.1303          |
+| 1.1391        | 6.0   | 6    | 4.1232          |
+| 1.1083        | 7.0   | 7    | 4.1190          |
+| 1.0829        | 8.0   | 8    | 4.1162          |
+| 1.0633        | 9.0   | 9    | 4.1131          |
+| 1.05          | 10.0  | 10   | 4.1108          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5e7e2943a45bdf8880eec94a90fabdc0ef73d8bc7d79dff7a8a5cfd91cf6da93
 size 39256456

 version https://git-lfs.github.com/spec/v1
+oid sha256:9ab5e319b196c0a9e19d54444d3dbcdb021ed662550ce77e83744a1efff6fae1
 size 39256456

runs/Mar11_14-04-49_byo-WS5/events.out.tfevents.1710162290.byo-WS5.256887.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:b2b22fb6de8e825a623b0fe36905fe2d56635f89aaf8dc384d9c172068cfb26c
+size 10127

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:edb94c619b340b0d80e7140ca60e5886398961d1a696d0a3115ab0b138bf8bdc
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:1ffd8063a4009b25c6ac0b77f6bd5247365eaa588138918216b9109515de9911
 size 4920