ryanyip7777
/

pmc_vit-l-14_hf

Zero-Shot Image Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

ryanyip7777 commited on Sep 7, 2023

Commit

625be1c

•

1 Parent(s): 42a50d2

Update README.md

Files changed (1) hide show

README.md +20 -0

README.md CHANGED Viewed

@@ -71,4 +71,24 @@ python -W ignore run_clip.py --model_name_or_path openai/clip-vit-large-patch14
       --logging_dir ./pmc_vit_logs \
       --save_total_limit 2 \
       --report_to  tensorboard
 ```

       --logging_dir ./pmc_vit_logs \
       --save_total_limit 2 \
       --report_to  tensorboard
+```
+### usage
+```python
+from PIL import Image
+import requests
+from transformers import CLIPProcessor, CLIPModel
+model = CLIPModel.from_pretrained("openai/clip-vit-large-patch14")
+processor = CLIPProcessor.from_pretrained("openai/clip-vit-large-patch14")
+url = "http://images.cocodataset.org/val2017/000000039769.jpg"
+image = Image.open(requests.get(url, stream=True).raw)
+inputs = processor(text=["a photo of a cat", "a photo of a dog"], images=image, return_tensors="pt", padding=True)
+outputs = model(**inputs)
+logits_per_image = outputs.logits_per_image # this is the image-text similarity score
+probs = logits_per_image.softmax(dim=1) # we can take the softmax to get the label probabilities
 ```