phdreg commited on
Commit
42d9102
1 Parent(s): 3ecaf04

Model save

Browse files
README.md ADDED
@@ -0,0 +1,58 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: t5-small
4
+ tags:
5
+ - generated_from_trainer
6
+ datasets:
7
+ - xsum
8
+ model-index:
9
+ - name: t5-small-finetuned-xsum
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # t5-small-finetuned-xsum
17
+
18
+ This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the xsum dataset.
19
+
20
+ ## Model description
21
+
22
+ More information needed
23
+
24
+ ## Intended uses & limitations
25
+
26
+ More information needed
27
+
28
+ ## Training and evaluation data
29
+
30
+ More information needed
31
+
32
+ ## Training procedure
33
+
34
+ ### Training hyperparameters
35
+
36
+ The following hyperparameters were used during training:
37
+ - learning_rate: 2e-05
38
+ - train_batch_size: 16
39
+ - eval_batch_size: 16
40
+ - seed: 42
41
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
+ - lr_scheduler_type: linear
43
+ - num_epochs: 1
44
+ - mixed_precision_training: Native AMP
45
+
46
+ ### Training results
47
+
48
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
49
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
50
+ | No log | 1.0 | 125 | 2.8312 | 19.1332 | 3.5412 | 15.2789 | 15.3325 | 18.562 |
51
+
52
+
53
+ ### Framework versions
54
+
55
+ - Transformers 4.40.1
56
+ - Pytorch 2.2.1+cu121
57
+ - Datasets 2.19.0
58
+ - Tokenizers 0.19.1
generation_config.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "decoder_start_token_id": 0,
3
+ "eos_token_id": 1,
4
+ "pad_token_id": 0,
5
+ "transformers_version": "4.40.1"
6
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0f062dc779c6d99e2ee370804b8ea6d6d680655f7f4a5d61380a6c45f4dc9e9c
3
  size 242041896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:46ff5032fbb12d124fbcce693c039bef4bb02c25e2032406d7f8696c2624aa39
3
  size 242041896
runs/Apr30_19-19-02_b23ec117b5f9/events.out.tfevents.1714505004.b23ec117b5f9.2057.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e608291a47c2a67ad6999c2a4eac3f95ab2f512eec2bcc33b5b3565c463bfb18
3
+ size 5673
runs/Apr30_19-19-02_b23ec117b5f9/events.out.tfevents.1714505270.b23ec117b5f9.2057.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:586987eaec2fd1ffa3db33d01462e87c71d712085a4ca126b621bcc1fe39ea0d
3
+ size 6536
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d7f4c8b817cf342cbe01510b15fc1f807313b6fa6a24b5ad25feb2b73f20a3cd
3
  size 5176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:08c2fb843ee6083a70dc77e4a01079df8d5fda22d208cf47f658e697d61a2996
3
  size 5176