File size: 1,189 Bytes
ce15bc3
 
 
3e7fec6
7bbbbc1
1384509
e192953
ce15bc3
 
f01fdf0
ce15bc3
5280137
9ff89ad
8d6b0f8
f3e25ba
 
 
f01fdf0
 
 
0cf2f2e
f01fdf0
 
 
 
3e7fec6
9ff89ad
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
---
language: it
license: mit
widget:
- text: "La pasta più semplice è aglio, [MASK] e peperoncino."
- text: "Per fare la carbonara servono le [MASK]." 
- text: "A tavola non può mancare del buon [MASK]."
---

# ChefBERTo 👨‍🍳 

**chefberto-italian-cased** is a BERT model obtained by MLM adaptive-tuning [**bert-base-italian-xxl-cased**](https://huggingface.co/dbmdz/bert-base-italian-xxl-cased) on Italian cooking recipes, approximately 50k sentences (2.6M words).

**Author:** Cristiano De Nobili ([@denocris](https://twitter.com/denocris) on Twitter, [LinkedIn](https://www.linkedin.com/in/cristiano-de-nobili/)) for [VINHOOD](https://www.vinhood.com/en/).
<p>
    <img src="https://drive.google.com/uc?export=view&id=1u5aY2wKu-X5DAzbOq7rsgGFW5_lGUAQn" width="400"> </br>
</p>

# Perplexity 

Test set: 9k sentences about food.

| Model | Perplexity | 
| ------ | ------ | 
| chefberto-italian-cased | **1.84**  | 
| bert-base-italian-xxl-cased | 2.85  |

# Usage

```python
from transformers import AutoModel, AutoTokenizer
model_name = "vinhood/chefberto-italian-cased"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModel.from_pretrained(model_name)
```