ai-forever commited on
Commit
ab562b7
1 Parent(s): 2ffd0ad

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +64 -0
README.md ADDED
@@ -0,0 +1,64 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # ruclip-vit-base-patch16-384
2
+
3
+ **RuCLIP** (**Ru**ssian **C**ontrastive **L**anguage–**I**mage **P**retraining) is a multimodal model
4
+ for obtaining images and text similarities and rearranging captions and pictures.
5
+ RuCLIP builds on a large body of work on zero-shot transfer, computer vision, natural language processing and
6
+ multimodal learning.
7
+
8
+ Model was trained by [Sber AI](https://github.com/sberbank-ai) and [SberDevices](https://sberdevices.ru/) teams.
9
+ * Task: `text ranking`; `image ranking`; `zero-shot image classification`;
10
+ * Type: `encoder`
11
+ * Num Parameters: `150M`
12
+ * Training Data Volume: `240 million text-image pairs`
13
+ * Language: `Russian`
14
+ * Context Length: `77`
15
+ * Transformer Layers: `12`
16
+ * Transformer Width: `512`
17
+ * Transformer Heads: `8`
18
+ * Image Size: `384`
19
+ * Vision Layers: `12`
20
+ * Vision Width: `768`
21
+ * Vision Patch Size: `16`
22
+
23
+ ## Usage [Github](https://github.com/sberbank-ai/ru-clip)
24
+
25
+ ```
26
+ pip install ruclip
27
+ ```
28
+
29
+ ```python
30
+ clip, processor = ruclip.load("ruclip-vit-base-patch16-384", device="cuda")
31
+ ```
32
+
33
+
34
+ ## Performance
35
+ We have evaluated the performance on the following datasets:
36
+
37
+ | Dataset | Metric Name | Metric Result |
38
+ |:--------------|:---------------|:--------------------|
39
+ | Food101 | acc | 0.689 |
40
+ | CIFAR10 | acc | 0.845 |
41
+ | CIFAR100 | acc | 0.569 |
42
+ | Birdsnap | acc | 0.195 |
43
+ | SUN397 | acc | 0.521 |
44
+ | Stanford Cars | acc | 0.626 |
45
+ | DTD | acc | 0.421 |
46
+ | MNIST | acc | 0.478 |
47
+ | STL10 | acc | 0.964 |
48
+ | PCam | acc | 0.501 |
49
+ | CLEVR | acc | 0.132 |
50
+ | Rendered SST2 | acc | 0.525 |
51
+ | ImageNet | acc | 0.482 |
52
+ | FGVC Aircraft | mean-per-class | 0.046 |
53
+ | Oxford Pets | mean-per-class | 0.635 |
54
+ | Caltech101 | mean-per-class | 0.835 |
55
+ | Flowers102 | mean-per-class | 0.452 |
56
+ | HatefulMemes | roc-auc | 0.543 |
57
+
58
+
59
+ # Authors
60
+
61
+ + Alex Shonenkov: [Github](https://github.com/shonenkov), [Kaggle GM](https://www.kaggle.com/shonenkov)
62
+ + Daniil Chesakov: [Github](https://github.com/Danyache)
63
+ + Denis Dimitrov: [Github](https://github.com/denndimitrov)
64
+ + Igor Pavlov: [Github](https://github.com/boomb0om)