Silvia Terragni
commited on
Commit
•
931590a
1
Parent(s):
dd625f5
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,53 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language: it
|
3 |
+
license:
|
4 |
+
datasets:
|
5 |
+
- wit
|
6 |
+
- ctl/conceptualCaptions
|
7 |
+
- mscoco-it
|
8 |
+
tags:
|
9 |
+
- italian
|
10 |
+
- bert
|
11 |
+
- vit
|
12 |
+
- vision
|
13 |
+
---
|
14 |
+
|
15 |
+
# CLIP-Italian
|
16 |
+
CLIP Italian is a CLIP-like Model for Italian. The CLIP model (Contrastive Language–Image Pre-training) was developed by researchers at OpenAI and is able to efficiently learn visual concepts from natural language supervision.
|
17 |
+
|
18 |
+
We fine-tuned a competitive Italian CLIP model with only ~1.4 million Italian image-text pairs. This model is part of the [Flax/Jax Community Week](https://discuss.huggingface.co/t/open-to-the-community-community-week-using-jax-flax-for-nlp-cv/7104), organized by [HuggingFace](https://huggingface.co/) and TPU usage sponsored by Google.
|
19 |
+
|
20 |
+
## Training Data
|
21 |
+
We considered three main sources of data:
|
22 |
+
- [WIT](https://github.com/google-research-datasets/wit)
|
23 |
+
- [MSCOCO-IT](https://github.com/crux82/mscoco-it)
|
24 |
+
- [Conceptual Captions](https://ai.google.com/research/ConceptualCaptions/)
|
25 |
+
|
26 |
+
## Training Procedure
|
27 |
+
Preprocessing, hardware used, hyperparameters...
|
28 |
+
|
29 |
+
## Evaluation Performance
|
30 |
+
|
31 |
+
|
32 |
+
## Limitations
|
33 |
+
|
34 |
+
|
35 |
+
## Usage
|
36 |
+
|
37 |
+
|
38 |
+
## Team members
|
39 |
+
- Federico Bianchi ([vinid](https://huggingface.co/vinid))
|
40 |
+
- Raphael Pisoni ([4rtemi5](https://huggingface.co/4rtemi5))
|
41 |
+
- Giuseppe Attanasio ([g8a9](https://huggingface.co/g8a9))
|
42 |
+
- Silvia Terragni ([silviatti](https://huggingface.co/silviatti))
|
43 |
+
- Dario Balestri ([D3Reo](https://huggingface.co/D3Reo))
|
44 |
+
- Gabriele Sarti ([gsarti](https://huggingface.co/gsarti))
|
45 |
+
- Sri Lakshmi ([srisweet](https://huggingface.co/srisweet))
|
46 |
+
|
47 |
+
## Useful links
|
48 |
+
- [CLIP Blog post](https://openai.com/blog/clip/)
|
49 |
+
- [CLIP paper](https://arxiv.org/abs/2103.00020)
|
50 |
+
- [Community Week README](https://github.com/huggingface/transformers/blob/master/examples/research_projects/jax-projects/README.md)
|
51 |
+
- [Community Week channel](https://discord.com/channels/858019234139602994/859711887520038933)
|
52 |
+
- [Hybrid CLIP example scripts](https://github.com/huggingface/transformers/tree/master/examples/research_projects/jax-projects/hybrid_clip)
|
53 |
+
- [Model Repository](https://huggingface.co/clip-italian/clip-italian-final/)
|