Moreno La Quatra
commited on
Commit
·
d5659a7
1
Parent(s):
2ec63fb
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,10 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
This model is a fine-tuned version of [sshleifer/distilbart-cnn-12-6](https://huggingface.co/sshleifer/distilbart-cnn-12-6) on the BBC News Summary dataset (https://www.kaggle.com/pariza/bbc-news-summary).
|
2 |
+
|
3 |
+
The model has been generated as part of the in-lab practice of **Deep NLP course** currently held at Politecnico di Torino.
|
4 |
+
|
5 |
+
Training parameters:
|
6 |
+
- `num_train_epochs=2`
|
7 |
+
- `fp16=True`
|
8 |
+
- `per_device_train_batch_size=1`
|
9 |
+
- `warmup_steps=10`
|
10 |
+
- `weight_decay=0.01`
|