End of training

Browse files

Files changed (4) hide show

README.md +66 -44
model.safetensors +1 -1
runs/Nov10_17-33-18_christopher-System-Product-Name/events.out.tfevents.1699597998.christopher-System-Product-Name.30388.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1133
 ## Model description
@@ -47,49 +47,71 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 1.6316        | 0.46  | 50   | 0.6287          |
-| 0.6403        | 0.93  | 100  | 0.3378          |
-| 0.4213        | 1.39  | 150  | 0.2460          |
-| 0.3452        | 1.85  | 200  | 0.2184          |
-| 0.306         | 2.31  | 250  | 0.1903          |
-| 0.2634        | 2.78  | 300  | 0.1807          |
-| 0.2423        | 3.24  | 350  | 0.1630          |
-| 0.2224        | 3.7   | 400  | 0.1599          |
-| 0.2107        | 4.17  | 450  | 0.1522          |
-| 0.1922        | 4.63  | 500  | 0.1515          |
-| 0.1887        | 5.09  | 550  | 0.1394          |
-| 0.1821        | 5.56  | 600  | 0.1414          |
-| 0.1705        | 6.02  | 650  | 0.1378          |
-| 0.1602        | 6.48  | 700  | 0.1330          |
-| 0.1579        | 6.94  | 750  | 0.1300          |
-| 0.1497        | 7.41  | 800  | 0.1282          |
-| 0.1534        | 7.87  | 850  | 0.1277          |
-| 0.147         | 8.33  | 900  | 0.1274          |
-| 0.1395        | 8.8   | 950  | 0.1204          |
-| 0.1361        | 9.26  | 1000 | 0.1235          |
-| 0.1353        | 9.72  | 1050 | 0.1210          |
-| 0.1303        | 10.19 | 1100 | 0.1220          |
-| 0.132         | 10.65 | 1150 | 0.1232          |
-| 0.1262        | 11.11 | 1200 | 0.1193          |
-| 0.1228        | 11.57 | 1250 | 0.1229          |
-| 0.1261        | 12.04 | 1300 | 0.1215          |
-| 0.1204        | 12.5  | 1350 | 0.1163          |
-| 0.12          | 12.96 | 1400 | 0.1189          |
-| 0.11          | 13.43 | 1450 | 0.1173          |
-| 0.1183        | 13.89 | 1500 | 0.1149          |
-| 0.108         | 14.35 | 1550 | 0.1178          |
-| 0.1122        | 14.81 | 1600 | 0.1150          |
-| 0.1126        | 15.28 | 1650 | 0.1157          |
-| 0.112         | 15.74 | 1700 | 0.1152          |
-| 0.1046        | 16.2  | 1750 | 0.1156          |
-| 0.1057        | 16.67 | 1800 | 0.1138          |
-| 0.1067        | 17.13 | 1850 | 0.1129          |
-| 0.1078        | 17.59 | 1900 | 0.1140          |
-| 0.1043        | 18.06 | 1950 | 0.1135          |
-| 0.1033        | 18.52 | 2000 | 0.1138          |
-| 0.1017        | 18.98 | 2050 | 0.1140          |
-| 0.102         | 19.44 | 2100 | 0.1125          |
-| 0.1012        | 19.91 | 2150 | 0.1133          |
 ### Framework versions

 This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1027
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.7961        | 0.3   | 50   | 0.7538          |
+| 0.7207        | 0.61  | 100  | 0.3876          |
+| 0.4596        | 0.91  | 150  | 0.2763          |
+| 0.3536        | 1.22  | 200  | 0.2265          |
+| 0.3089        | 1.52  | 250  | 0.1937          |
+| 0.2736        | 1.83  | 300  | 0.1842          |
+| 0.2415        | 2.13  | 350  | 0.1713          |
+| 0.2309        | 2.44  | 400  | 0.1601          |
+| 0.2011        | 2.74  | 450  | 0.1533          |
+| 0.198         | 3.05  | 500  | 0.1464          |
+| 0.1816        | 3.35  | 550  | 0.1418          |
+| 0.1887        | 3.66  | 600  | 0.1354          |
+| 0.1717        | 3.96  | 650  | 0.1295          |
+| 0.1589        | 4.27  | 700  | 0.1320          |
+| 0.1606        | 4.57  | 750  | 0.1230          |
+| 0.1545        | 4.88  | 800  | 0.1255          |
+| 0.1502        | 5.18  | 850  | 0.1247          |
+| 0.1438        | 5.49  | 900  | 0.1251          |
+| 0.1395        | 5.79  | 950  | 0.1222          |
+| 0.1414        | 6.1   | 1000 | 0.1173          |
+| 0.133         | 6.4   | 1050 | 0.1149          |
+| 0.1338        | 6.71  | 1100 | 0.1124          |
+| 0.1361        | 7.01  | 1150 | 0.1148          |
+| 0.1269        | 7.32  | 1200 | 0.1137          |
+| 0.123         | 7.62  | 1250 | 0.1145          |
+| 0.1203        | 7.93  | 1300 | 0.1129          |
+| 0.1194        | 8.23  | 1350 | 0.1081          |
+| 0.1177        | 8.54  | 1400 | 0.1099          |
+| 0.1173        | 8.84  | 1450 | 0.1109          |
+| 0.113         | 9.15  | 1500 | 0.1107          |
+| 0.1122        | 9.45  | 1550 | 0.1068          |
+| 0.11          | 9.76  | 1600 | 0.1072          |
+| 0.1078        | 10.06 | 1650 | 0.1086          |
+| 0.101         | 10.37 | 1700 | 0.1088          |
+| 0.1106        | 10.67 | 1750 | 0.1079          |
+| 0.1094        | 10.98 | 1800 | 0.1109          |
+| 0.1072        | 11.28 | 1850 | 0.1054          |
+| 0.103         | 11.59 | 1900 | 0.1062          |
+| 0.1009        | 11.89 | 1950 | 0.1051          |
+| 0.1005        | 12.2  | 2000 | 0.1049          |
+| 0.0985        | 12.5  | 2050 | 0.1059          |
+| 0.0983        | 12.8  | 2100 | 0.1063          |
+| 0.0953        | 13.11 | 2150 | 0.1062          |
+| 0.0935        | 13.41 | 2200 | 0.1044          |
+| 0.1003        | 13.72 | 2250 | 0.1034          |
+| 0.0935        | 14.02 | 2300 | 0.1049          |
+| 0.0935        | 14.33 | 2350 | 0.1038          |
+| 0.096         | 14.63 | 2400 | 0.1020          |
+| 0.0894        | 14.94 | 2450 | 0.1048          |
+| 0.0931        | 15.24 | 2500 | 0.1034          |
+| 0.0888        | 15.55 | 2550 | 0.1030          |
+| 0.0904        | 15.85 | 2600 | 0.1038          |
+| 0.0885        | 16.16 | 2650 | 0.1046          |
+| 0.088         | 16.46 | 2700 | 0.1041          |
+| 0.0925        | 16.77 | 2750 | 0.1027          |
+| 0.0835        | 17.07 | 2800 | 0.1034          |
+| 0.089         | 17.38 | 2850 | 0.1036          |
+| 0.0844        | 17.68 | 2900 | 0.1043          |
+| 0.0866        | 17.99 | 2950 | 0.1031          |
+| 0.0835        | 18.29 | 3000 | 0.1030          |
+| 0.0826        | 18.6  | 3050 | 0.1028          |
+| 0.0874        | 18.9  | 3100 | 0.1018          |
+| 0.0846        | 19.21 | 3150 | 0.1030          |
+| 0.0852        | 19.51 | 3200 | 0.1026          |
+| 0.0835        | 19.82 | 3250 | 0.1027          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:277450e6e2c32486280a66c0b9ab63250650b85cca9d9746a145c0ecb4d001a0
 size 307867048

 version https://git-lfs.github.com/spec/v1
+oid sha256:e6fa312c0462dad2cef000dae1feadd95c8999644309948dfd9f2c794e141225
 size 307867048

runs/Nov10_17-33-18_christopher-System-Product-Name/events.out.tfevents.1699597998.christopher-System-Product-Name.30388.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5b37193c65be21980ec38823076173b098f099925b49453e710323736c804dff
+size 33464

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8cce8e4d2c3994de90beaf7187c95c1c845d1b3b40d1f2bf070c0183d2da8ae7
 size 4792

 version https://git-lfs.github.com/spec/v1
+oid sha256:4b905a3df27a9d55cacafa59f3b6d59fcbcdb4816ce375f5a526c4e0a1f0cd09
 size 4792