Xmm commited on
Commit
2125e8e
1 Parent(s): 7a490f2

Training in progress, step 500

Browse files
Files changed (5) hide show
  1. README.md +51 -0
  2. config.json +1 -1
  3. generation_config.json +13 -0
  4. pytorch_model.bin +1 -1
  5. training_args.bin +1 -1
README.md ADDED
@@ -0,0 +1,51 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - generated_from_trainer
5
+ datasets:
6
+ - cnn_dailymail
7
+ model-index:
8
+ - name: led-large-16384-cnn_dailymail
9
+ results: []
10
+ ---
11
+
12
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
+ should probably proofread and complete it, then remove this comment. -->
14
+
15
+ # led-large-16384-cnn_dailymail
16
+
17
+ This model is a fine-tuned version of [allenai/led-base-16384](https://huggingface.co/allenai/led-base-16384) on the cnn_dailymail dataset.
18
+
19
+ ## Model description
20
+
21
+ More information needed
22
+
23
+ ## Intended uses & limitations
24
+
25
+ More information needed
26
+
27
+ ## Training and evaluation data
28
+
29
+ More information needed
30
+
31
+ ## Training procedure
32
+
33
+ ### Training hyperparameters
34
+
35
+ The following hyperparameters were used during training:
36
+ - learning_rate: 5e-05
37
+ - train_batch_size: 1
38
+ - eval_batch_size: 1
39
+ - seed: 42
40
+ - gradient_accumulation_steps: 8
41
+ - total_train_batch_size: 8
42
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
+ - lr_scheduler_type: linear
44
+ - num_epochs: 5
45
+
46
+ ### Framework versions
47
+
48
+ - Transformers 4.30.2
49
+ - Pytorch 1.13.1
50
+ - Datasets 2.13.0
51
+ - Tokenizers 0.13.3
config.json CHANGED
@@ -54,6 +54,6 @@
54
  "pad_token_id": 1,
55
  "torch_dtype": "float32",
56
  "transformers_version": "4.30.2",
57
- "use_cache": false,
58
  "vocab_size": 50265
59
  }
 
54
  "pad_token_id": 1,
55
  "torch_dtype": "float32",
56
  "transformers_version": "4.30.2",
57
+ "use_cache": true,
58
  "vocab_size": 50265
59
  }
generation_config.json ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token_id": 0,
3
+ "decoder_start_token_id": 2,
4
+ "early_stopping": true,
5
+ "eos_token_id": 2,
6
+ "length_penalty": 2.0,
7
+ "max_length": 1024,
8
+ "min_length": 100,
9
+ "no_repeat_ngram_size": 3,
10
+ "pad_token_id": 1,
11
+ "transformers_version": "4.30.2",
12
+ "use_cache": false
13
+ }
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7b89a7deb65b7a6bbfde4a73a672fe8c6e3d6c9a8ef4e0d8ea59df0cc3dbb6c9
3
  size 647680813
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:324dcb4bc14f8f022cef13a10f020c0f579cacc9dc9739f861f48544428fb583
3
  size 647680813
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1694af1012975cf1642e3823bb01cb5deb993d196b4cbb1c3aeb8295003a3469
3
  size 4091
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e12a1061293ead53bef31527b080c78678e56c883582aa5992667009ed71fd8a
3
  size 4091