tyzhu
/

lmind_hotpot_train8000_eval7405_v1_recite_qa_Qwen_Qwen1.5-4B_lora2

PEFT

Safetensors

Generated from Trainer

Eval Results

Model card Files Files and versions Community

tyzhu commited on Jun 6

Commit

0b5ada5

•

1 Parent(s): e894d4e

Model save

Browse files

Files changed (2) hide show

README.md +13 -25
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -3,23 +3,11 @@ license: other
 base_model: Qwen/Qwen1.5-4B
 tags:
 - generated_from_trainer
-datasets:
-- tyzhu/lmind_hotpot_train8000_eval7405_v1_recite_qa
 metrics:
 - accuracy
 model-index:
 - name: lmind_hotpot_train8000_eval7405_v1_recite_qa_Qwen_Qwen1.5-4B_lora2
-  results:
-  - task:
-      name: Causal Language Modeling
-      type: text-generation
-    dataset:
-      name: tyzhu/lmind_hotpot_train8000_eval7405_v1_recite_qa
-      type: tyzhu/lmind_hotpot_train8000_eval7405_v1_recite_qa
-    metrics:
-    - name: Accuracy
-      type: accuracy
-      value: 0.7780232896652111
 library_name: peft
 ---
@@ -28,10 +16,10 @@ should probably proofread and complete it, then remove this comment. -->
 # lmind_hotpot_train8000_eval7405_v1_recite_qa_Qwen_Qwen1.5-4B_lora2
-This model is a fine-tuned version of [Qwen/Qwen1.5-4B](https://huggingface.co/Qwen/Qwen1.5-4B) on the tyzhu/lmind_hotpot_train8000_eval7405_v1_recite_qa dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4804
 - Accuracy: 0.7780
 ## Model description
@@ -78,16 +66,16 @@ The following hyperparameters were used during training:
 | 0.6144        | 8.0     | 8714  | 0.7404   | 0.7907          |
 | 0.5355        | 8.9998  | 9803  | 0.7469   | 0.7288          |
 | 0.4584        | 9.9977  | 10890 | 0.7531   | 0.6794          |
-| 0.413         | 10.9998 | 11979 | 0.6292   | 0.7577          |
-| 0.3731        | 11.9995 | 13068 | 0.5926   | 0.7616          |
-| 0.3423        | 12.9993 | 14157 | 0.5620   | 0.7656          |
-| 0.3185        | 14.0    | 15247 | 0.5426   | 0.7682          |
-| 0.2924        | 14.9998 | 16336 | 0.5232   | 0.7708          |
-| 0.2824        | 15.9995 | 17425 | 0.5129   | 0.7727          |
-| 0.2669        | 16.9993 | 18514 | 0.4988   | 0.7748          |
-| 0.2517        | 18.0    | 19604 | 0.4892   | 0.7762          |
-| 0.2376        | 18.9998 | 20693 | 0.4808   | 0.7773          |
-| 0.2316        | 19.9977 | 21780 | 0.4804   | 0.7780          |
 ### Framework versions

 base_model: Qwen/Qwen1.5-4B
 tags:
 - generated_from_trainer
 metrics:
 - accuracy
 model-index:
 - name: lmind_hotpot_train8000_eval7405_v1_recite_qa_Qwen_Qwen1.5-4B_lora2
+  results: []
 library_name: peft
 ---
 # lmind_hotpot_train8000_eval7405_v1_recite_qa_Qwen_Qwen1.5-4B_lora2
+This model is a fine-tuned version of [Qwen/Qwen1.5-4B](https://huggingface.co/Qwen/Qwen1.5-4B) on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Accuracy: 0.7780
+- Loss: 0.4804
 ## Model description
 | 0.6144        | 8.0     | 8714  | 0.7404   | 0.7907          |
 | 0.5355        | 8.9998  | 9803  | 0.7469   | 0.7288          |
 | 0.4584        | 9.9977  | 10890 | 0.7531   | 0.6794          |
+| 0.413         | 10.9998 | 11979 | 0.7577   | 0.6292          |
+| 0.3731        | 11.9995 | 13068 | 0.7616   | 0.5926          |
+| 0.3423        | 12.9993 | 14157 | 0.7656   | 0.5620          |
+| 0.3185        | 14.0    | 15247 | 0.7682   | 0.5426          |
+| 0.2924        | 14.9998 | 16336 | 0.7708   | 0.5232          |
+| 0.2824        | 15.9995 | 17425 | 0.7727   | 0.5129          |
+| 0.2669        | 16.9993 | 18514 | 0.7748   | 0.4988          |
+| 0.2517        | 18.0    | 19604 | 0.7762   | 0.4892          |
+| 0.2376        | 18.9998 | 20693 | 0.7773   | 0.4808          |
+| 0.2316        | 19.9977 | 21780 | 0.7780   | 0.4804          |
 ### Framework versions

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e6d5fc939c574c4e595c73c75591d26bb1e3d44da7728cfff9a563f03485212f
 size 5176

 version https://git-lfs.github.com/spec/v1
+oid sha256:2c4b324b3d635603b7a499b3bae651530278432d8833a0e95b6ca0128c6aac73
 size 5176