madatnlp
/

ke-t5-scratch

@@ -13,9 +13,9 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [madatnlp/ke-t5-math-py](https://huggingface.co/madatnlp/ke-t5-math-py) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 0.7822
-- Validation Loss: 0.7830
-- Epoch: 78
 ## Model description
@@ -41,85 +41,7 @@ The following hyperparameters were used during training:
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
-| 8.1521     | 4.7300          | 0     |
-| 4.5192     | 3.2568          | 1     |
-| 3.4913     | 2.7407          | 2     |
-| 2.9793     | 2.3012          | 3     |
-| 2.6884     | 2.2078          | 4     |
-| 2.4174     | 1.9750          | 5     |
-| 2.2986     | 1.8614          | 6     |
-| 2.1074     | 1.5899          | 7     |
-| 2.0098     | 1.6175          | 8     |
-| 1.9521     | 1.6886          | 9     |
-| 1.8728     | 1.4932          | 10    |
-| 1.7885     | 1.4924          | 11    |
-| 1.6926     | 1.5297          | 12    |
-| 1.6965     | 1.3357          | 13    |
-| 1.6310     | 1.4610          | 14    |
-| 1.6144     | 1.3840          | 15    |
-| 1.5745     | 1.3763          | 16    |
-| 1.5375     | 1.2571          | 17    |
-| 1.5061     | 1.2785          | 18    |
-| 1.4627     | 1.2533          | 19    |
-| 1.4811     | 1.2842          | 20    |
-| 1.4139     | 1.1947          | 21    |
-| 1.3810     | 1.1776          | 22    |
-| 1.3875     | 1.1778          | 23    |
-| 1.3648     | 1.1158          | 24    |
-| 1.3542     | 1.0514          | 25    |
-| 1.3055     | 1.1236          | 26    |
-| 1.3064     | 1.0815          | 27    |
-| 1.2706     | 1.0877          | 28    |
-| 1.2624     | 1.0519          | 29    |
-| 1.2626     | 1.0742          | 30    |
-| 1.2480     | 1.0478          | 31    |
-| 1.2137     | 1.0730          | 32    |
-| 1.2224     | 1.0391          | 33    |
-| 1.1865     | 0.9326          | 34    |
-| 1.1898     | 0.9682          | 35    |
-| 1.1624     | 1.0019          | 36    |
-| 1.1561     | 1.0398          | 37    |
-| 1.1376     | 0.9905          | 38    |
-| 1.1346     | 0.9957          | 39    |
-| 1.1269     | 1.0249          | 40    |
-| 1.1009     | 0.9031          | 41    |
-| 1.0933     | 0.9404          | 42    |
-| 1.0637     | 0.9948          | 43    |
-| 1.0598     | 0.9522          | 44    |
-| 1.0699     | 0.8862          | 45    |
-| 1.0712     | 0.8994          | 46    |
-| 1.0542     | 0.9584          | 47    |
-| 1.0199     | 0.9133          | 48    |
-| 1.0248     | 0.9043          | 49    |
-| 1.0052     | 0.8633          | 50    |
-| 0.9896     | 0.9102          | 51    |
-| 1.0057     | 0.8636          | 52    |
-| 0.9817     | 0.8472          | 53    |
-| 0.9668     | 0.8686          | 54    |
-| 0.9541     | 0.8884          | 55    |
-| 0.9731     | 0.8862          | 56    |
-| 0.9364     | 0.8166          | 57    |
-| 0.9429     | 0.8448          | 58    |
-| 0.9285     | 0.8711          | 59    |
-| 0.9222     | 0.8232          | 60    |
-| 0.9163     | 0.8039          | 61    |
-| 0.8940     | 0.8100          | 62    |
-| 0.8967     | 0.8318          | 63    |
-| 0.8892     | 0.7802          | 64    |
-| 0.8869     | 0.7814          | 65    |
-| 0.8683     | 0.7917          | 66    |
-| 0.8623     | 0.8259          | 67    |
-| 0.8480     | 0.7539          | 68    |
-| 0.8441     | 0.8287          | 69    |
-| 0.8312     | 0.7945          | 70    |
-| 0.8215     | 0.8013          | 71    |
-| 0.8289     | 0.7837          | 72    |
-| 0.8181     | 0.7744          | 73    |
-| 0.8062     | 0.7707          | 74    |
-| 0.8025     | 0.7796          | 75    |
-| 0.7848     | 0.7748          | 76    |
-| 0.7860     | 0.7861          | 77    |
-| 0.7822     | 0.7830          | 78    |
 ### Framework versions

 This model is a fine-tuned version of [madatnlp/ke-t5-math-py](https://huggingface.co/madatnlp/ke-t5-math-py) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 4.2751
+- Validation Loss: 2.1074
+- Epoch: 0
 ## Model description
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
+| 4.2751     | 2.1074          | 0     |
 ### Framework versions

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e8bb7df150dcf3a0b1db2be7c9ccbf65a02e5c4961179dc5a2aa826d9c0b02be
 size 831509840

 version https://git-lfs.github.com/spec/v1
+oid sha256:402117b1fe2824d706fc9ec5d07405b0b702793247f5459a03dfdd8f55bbd52d
 size 831509840

tokenizer.json CHANGED Viewed

The diff for this file is too large to render. See raw diff