knachinen commited on
Commit
91c51d3
1 Parent(s): 198b84a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +65 -0
README.md CHANGED
@@ -1,6 +1,52 @@
1
  ---
 
 
 
 
2
  library_name: peft
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4
  ## Training procedure
5
 
6
 
@@ -15,7 +61,26 @@ The following `bitsandbytes` quantization config was used during training:
15
  - bnb_4bit_quant_type: nf4
16
  - bnb_4bit_use_double_quant: True
17
  - bnb_4bit_compute_dtype: bfloat16
 
 
 
 
 
 
 
 
 
 
 
18
  ### Framework versions
19
 
20
 
21
  - PEFT 0.5.0
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - en
4
+ tags:
5
+ - summarization
6
  library_name: peft
7
+ datasets:
8
+ - scientific_papers
9
+ metrics:
10
+ - rouge
11
+ model-index:
12
+ - name: flan-t5-base-finetuned-arxiv
13
+ results:
14
+ - task:
15
+ type: summarization
16
+ name: Summarization
17
+ dataset:
18
+ name: scientific_papers
19
+ type: scientific_papers
20
+ args: arxiv
21
+ metrics:
22
+ - name: Rouge1
23
+ type: rouge
24
+ value: 12.032000
25
+ - name: Rouge2
26
+ type: rouge
27
+ value: 4.384100
28
+ - name: Rougel
29
+ type: rouge
30
+ value: 9.842600
31
+ - name: Rougelsum
32
+ type: rouge
33
+ value: 11.139600
34
+
35
  ---
36
+
37
+ ## flan-t5-base-finetuned-arxiv
38
+
39
+ This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the scientific_papers dataset.
40
+ It achieves the following results on the evaluation set:
41
+
42
+ - Loss: 2.485082
43
+ - Rouge1: 12.032000
44
+ - Rouge2: 4.38100
45
+ - Rougel: 9.842600
46
+ - Rougelsum: 11.139600
47
+ - Gen Len: 19.000000
48
+
49
+
50
  ## Training procedure
51
 
52
 
 
61
  - bnb_4bit_quant_type: nf4
62
  - bnb_4bit_use_double_quant: True
63
  - bnb_4bit_compute_dtype: bfloat16
64
+
65
+ ### Training hyperparameters
66
+
67
+ The following hyperparameters were used during training:
68
+ - learning_rate: 2e-4
69
+ - weight_decay: 0.01
70
+ - train_batch_size: 32
71
+ - optimizer: paged_adamw_8bit (8-bit adam optimization)
72
+ - num_epochs: 1.2
73
+ - fp16: False
74
+
75
  ### Framework versions
76
 
77
 
78
  - PEFT 0.5.0
79
+ - Transformers 4.35.0
80
+ - Pytorch 1.10.1+cu111
81
+ - Datasets 2.14.7
82
+ - Tokenizers 0.14.1
83
+ - bitsandbytes 0.41.2.post2
84
+ - accelerate 0.24.0
85
+ - evaluate 0.4.1
86
+ - rouge-score 0.1.2