pszemraj commited on
Commit
4eca727
·
1 Parent(s): 7e5adc6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +74 -41
README.md CHANGED
@@ -7,60 +7,93 @@ metrics:
7
  model-index:
8
  - name: multi-emails-hq-pythia-410m-deduped-r1
9
  results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  ---
11
 
12
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
- should probably proofread and complete it, then remove this comment. -->
14
 
15
- # multi-emails-hq-pythia-410m-deduped-r1
16
 
17
  This model is a fine-tuned version of [EleutherAI/pythia-410m-deduped](https://huggingface.co/EleutherAI/pythia-410m-deduped) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
  - Loss: 2.1018
20
  - Accuracy: 0.6157
 
21
 
 
22
  ## Model description
23
 
24
- More information needed
 
25
 
26
- ## Intended uses & limitations
27
 
28
- More information needed
29
-
30
- ## Training and evaluation data
31
-
32
- More information needed
33
-
34
- ## Training procedure
35
-
36
- ### Training hyperparameters
37
-
38
- The following hyperparameters were used during training:
39
- - learning_rate: 0.0002
40
- - train_batch_size: 8
41
- - eval_batch_size: 2
42
- - seed: 69
43
- - distributed_type: multi-GPU
44
- - gradient_accumulation_steps: 16
45
- - total_train_batch_size: 128
46
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
- - lr_scheduler_type: cosine
48
- - lr_scheduler_warmup_ratio: 0.05
49
- - num_epochs: 4.0
50
-
51
- ### Training results
52
-
53
- | Training Loss | Epoch | Step | Validation Loss | Accuracy |
54
- |:-------------:|:-----:|:----:|:---------------:|:--------:|
55
- | 2.3067 | 1.0 | 38 | 2.3559 | 0.5594 |
56
- | 1.9243 | 2.0 | 76 | 2.1283 | 0.5975 |
57
- | 1.5971 | 3.0 | 114 | 2.0759 | 0.6140 |
58
- | 1.4139 | 4.0 | 152 | 2.1018 | 0.6157 |
59
 
 
60
 
61
- ### Framework versions
62
 
63
- - Transformers 4.27.0.dev0
64
- - Pytorch 2.0.0.dev20230130+cu118
65
- - Datasets 2.9.0
66
- - Tokenizers 0.13.2
 
7
  model-index:
8
  - name: multi-emails-hq-pythia-410m-deduped-r1
9
  results: []
10
+ widget:
11
+ - text: >-
12
+ Good Morning Professor Beans,
13
+
14
+ Hope you are doing well. I just wanted to reach out and ask if
15
+ differential calculus will be on the exam
16
+ example_title: email to prof
17
+ - text: >-
18
+ Hey <NAME>,
19
+
20
+
21
+ Thank you for signing up for my weekly newsletter. Before we get started,
22
+ you'll have to confirm your email address.
23
+ example_title: newsletter
24
+ - text: >-
25
+ Hi <NAME>,
26
+
27
+
28
+ I hope this email finds you well. I wanted to reach out and ask about
29
+ office hours
30
+ example_title: office hours
31
+ - text: >-
32
+ Greetings <NAME>,
33
+
34
+
35
+ I hope you had a splendid evening at the Company sausage eating festival.
36
+ I am reaching out because
37
+ example_title: festival
38
+ - text: |-
39
+ Good Morning Harold,
40
+
41
+ I was wondering when the next
42
+ example_title: event
43
+ - text: URGENT - I need the TPS reports
44
+ example_title: URGENT
45
+ - text: |-
46
+ Hi Archibald,
47
+
48
+ I hope this email finds you extremely well.
49
+ example_title: emails that find you
50
+ - text: |-
51
+ Hello there.
52
+
53
+ I just wanted to reach out and check in to
54
+ example_title: checking in
55
+ - text: >-
56
+ Hello <NAME>,
57
+
58
+
59
+ I hope this email finds you well. I wanted to reach out and see if you've
60
+ enjoyed your time with us
61
+ example_title: work well
62
+ - text: >-
63
+ Hi <NAME>,
64
+
65
+
66
+ I hope this email finds you well. I wanted to reach out and see if we
67
+ could catch up
68
+ example_title: catch up
69
+ - text: >-
70
+ I'm <NAME> and I just moved into the area and wanted to reach out and get
71
+ some details on where I could get groceries and
72
+ example_title: grocery
73
+ datasets:
74
+ - postbot/multi-emails-hq
75
+ language:
76
+ - en
77
+ pipeline_tag: text-generation
78
  ---
79
 
 
 
80
 
81
+ # emailgen-pythia-410m-deduped
82
 
83
  This model is a fine-tuned version of [EleutherAI/pythia-410m-deduped](https://huggingface.co/EleutherAI/pythia-410m-deduped) on the None dataset.
84
  It achieves the following results on the evaluation set:
85
  - Loss: 2.1018
86
  - Accuracy: 0.6157
87
+ - perplexity: 8.181
88
 
89
+ -
90
  ## Model description
91
 
92
+ - fine-tuned on dataset of emails for 4 epochs
93
+ - intended use: "text completion" of partially written emails
94
 
 
95
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
96
 
97
+ ---
98
 
 
99