rudyvdbrink
/

Llama-3.2-1B-binary-citation-classifier

PEFT

Safetensors

Generated from Trainer

Model card Files Files and versions Community

rudyvdbrink commited on Jul 17

Commit

1da5bb6

verified ·

1 Parent(s): 35eb331

Llama-3.2-1B-binary-citation-classifier

Browse files

Files changed (2) hide show

README.md +27 -24
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,18 +1,18 @@
----
-library_name: peft
-license: llama3.2
-base_model: meta-llama/Llama-3.2-1B
-tags:
-- generated_from_trainer
-metrics:
-- accuracy
-- f1
-- precision
-- recall
-model-index:
-- name: Llama-3.2-1B-binary-citation-classifier
-  results: []
----
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/Llama-3.2-1B](https://huggingface.co/meta-llama/Llama-3.2-1B) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5812
-- Accuracy: 0.72
-- F1: 0.7200
-- Precision: 0.7200
-- Recall: 0.72
 ## Model description
@@ -52,16 +52,19 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 32
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Precision | Recall |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
-| 0.6699        | 1.0   | 500  | 0.6119          | 0.694    | 0.6940 | 0.6941    | 0.694  |
-| 0.5799        | 2.0   | 1000 | 0.5651          | 0.725    | 0.7250 | 0.7250    | 0.725  |
-| 0.5734        | 3.0   | 1500 | 0.5546          | 0.733    | 0.7330 | 0.7331    | 0.733  |
 ### Framework versions

+---
+library_name: peft
+license: llama3.2
+base_model: meta-llama/Llama-3.2-1B
+tags:
+- generated_from_trainer
+metrics:
+- accuracy
+- f1
+- precision
+- recall
+model-index:
+- name: Llama-3.2-1B-binary-citation-classifier
+  results: []
+---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/Llama-3.2-1B](https://huggingface.co/meta-llama/Llama-3.2-1B) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5450
+- Accuracy: 0.746
+- F1: 0.7460
+- Precision: 0.7460
+- Recall: 0.746
 ## Model description
 - total_train_batch_size: 32
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 6
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Precision | Recall |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
+| 0.6249        | 1.0   | 500  | 0.5853          | 0.716    | 0.7160 | 0.7161    | 0.716  |
+| 0.5585        | 2.0   | 1000 | 0.5523          | 0.748    | 0.7478 | 0.7487    | 0.748  |
+| 0.6066        | 3.0   | 1500 | 0.5303          | 0.7535   | 0.7535 | 0.7535    | 0.7535 |
+| 0.5447        | 4.0   | 2000 | 0.5202          | 0.761    | 0.7609 | 0.7615    | 0.761  |
+| 0.4709        | 5.0   | 2500 | 0.5168          | 0.7645   | 0.7645 | 0.7645    | 0.7645 |
+| 0.5002        | 6.0   | 3000 | 0.5137          | 0.7695   | 0.7695 | 0.7696    | 0.7695 |
 ### Framework versions

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bfbcec37d65313a926c5cac80a3cdee3030727f174302811becde881321b6367
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:bd5d0c5f7f5df9e64299322be4021424aef70153ac712c58db18e41ca030e619
 size 5304