zuppif commited on
Commit
1042d4c
1 Parent(s): 66b5e6b

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +65 -0
README.md ADDED
@@ -0,0 +1,65 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - vision
5
+ - image-classification
6
+
7
+ datasets:
8
+ - imagenet-1k
9
+
10
+ widgets:
11
+ - src: https://huggingface.co/datasets/mishig/sample_images/resolve/main/tiger.jpg
12
+ example_title: Tiger
13
+ - src: https://huggingface.co/datasets/mishig/sample_images/resolve/main/teapot.jpg
14
+ example_title: Teapot
15
+ - src: https://huggingface.co/datasets/mishig/sample_images/resolve/main/palace.jpg
16
+ example_title: Palace
17
+
18
+ ---
19
+
20
+ # Van
21
+
22
+ Van model trained on imagenet-1k. It was introduced in the paper [Visual Attention Network](https://arxiv.org/abs/2202.09741) and first released in [this repository](https://github.com/Visual-Attention-Network/VAN-Classification).
23
+
24
+ Disclaimer: The team releasing Van did not write a model card for this model so this model card has been written by the Hugging Face team.
25
+
26
+ ## Model description
27
+
28
+ This paper introduces a new attention layer based on convolution operations able to capture both local and distant relationships. This is done by combining normal and large kernel convolution layers. The latter uses a dilated convolution to capture distant correlations.
29
+
30
+ ![model image](https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/van_architecture.png)
31
+
32
+ ## Intended uses & limitations
33
+
34
+ You can use the raw model for image classification. See the [model hub](https://huggingface.co/models?search=van) to look for
35
+ fine-tuned versions on a task that interests you.
36
+
37
+ ### How to use
38
+
39
+ Here is how to use this model:
40
+
41
+ ```python
42
+ >>> from transformers import AutoFeatureExtractor, VanForImageClassification
43
+ >>> import torch
44
+ >>> from datasets import load_dataset
45
+
46
+ >>> dataset = load_dataset("huggingface/cats-image")
47
+ >>> image = dataset["test"]["image"][0]
48
+
49
+ >>> feature_extractor = AutoFeatureExtractor.from_pretrained("Visual-Attention-Network/van-base")
50
+ >>> model = VanForImageClassification.from_pretrained("Visual-Attention-Network/van-base")
51
+
52
+ >>> inputs = feature_extractor(image, return_tensors="pt")
53
+
54
+ >>> with torch.no_grad():
55
+ ... logits = model(**inputs).logits
56
+
57
+ >>> # model predicts one of the 1000 ImageNet classes
58
+ >>> predicted_label = logits.argmax(-1).item()
59
+ >>> print(model.config.id2label[predicted_label])
60
+ tabby, tabby cat
61
+ ```
62
+
63
+
64
+
65
+ For more code examples, we refer to the [documentation](https://huggingface.co/docs/transformers/master/en/model_doc/van).