oeg
/

software_benchmark_multidomain

Token Classification

software_mentions

Inference Endpoints

Model card Files Files and versions Community

esgg commited on Oct 23, 2023

Commit

5654734

•

1 Parent(s): f71da52

Update README.md

Files changed (1) hide show

README.md +24 -0

README.md CHANGED Viewed

@@ -38,10 +38,34 @@ The training code can be found on [Github](https://github.com/oeg-upm/software_m
 ## Evaluation Results
 * Precision: 0.8928176795580111
 * Recall: 0.8568398727465536
 * F1-score: 0.8744588744588745
 ## Acknoledgements
 This is a work done thank to the effort of other projects:

 ## Evaluation Results
+These are the hyperparameters used to train the model:
+* evaluation_strategy = "epoch"
+* save_strategy="no"
+* per_device_train_batch_size=16
+* per_device_eval_batch_size=16
+* num_train_epochs=3
+* weight_decay=1e-5
+* learning_rate=1e-4
+The evaluation results are:
 * Precision: 0.8928176795580111
 * Recall: 0.8568398727465536
 * F1-score: 0.8744588744588745
+This model has been compared with some generative models such as llama2 and hermes using the testing part of the benchmark. Following, we present the results of partial matches, it means, the predictions are included in the corpus
+### Llama2 (7B)
+* Precision: 0.6342857142857142
+* Recall: 0.7161290322580646
+* F1-score: 0.67
+### Hermes (13B)
+* Precision: 0.4666666666666667
+* Recall: 0.509090909090909
+* F1-score: 0.4869565217391304
 ## Acknoledgements
 This is a work done thank to the effort of other projects: