File size: 1,189 Bytes
ce15bc3 3e7fec6 7bbbbc1 1384509 e192953 ce15bc3 f01fdf0 ce15bc3 5280137 9ff89ad 8d6b0f8 f3e25ba f01fdf0 0cf2f2e f01fdf0 3e7fec6 9ff89ad |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 |
---
language: it
license: mit
widget:
- text: "La pasta più semplice è aglio, [MASK] e peperoncino."
- text: "Per fare la carbonara servono le [MASK]."
- text: "A tavola non può mancare del buon [MASK]."
---
# ChefBERTo 👨🍳
**chefberto-italian-cased** is a BERT model obtained by MLM adaptive-tuning [**bert-base-italian-xxl-cased**](https://huggingface.co./dbmdz/bert-base-italian-xxl-cased) on Italian cooking recipes, approximately 50k sentences (2.6M words).
**Author:** Cristiano De Nobili ([@denocris](https://twitter.com/denocris) on Twitter, [LinkedIn](https://www.linkedin.com/in/cristiano-de-nobili/)) for [VINHOOD](https://www.vinhood.com/en/).
<p>
<img src="https://drive.google.com/uc?export=view&id=1u5aY2wKu-X5DAzbOq7rsgGFW5_lGUAQn" width="400"> </br>
</p>
# Perplexity
Test set: 9k sentences about food.
| Model | Perplexity |
| ------ | ------ |
| chefberto-italian-cased | **1.84** |
| bert-base-italian-xxl-cased | 2.85 |
# Usage
```python
from transformers import AutoModel, AutoTokenizer
model_name = "vinhood/chefberto-italian-cased"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModel.from_pretrained(model_name)
``` |