NP-NP
/

pokemon_model

Image Classification

Transformers

Safetensors

vit

Inference Endpoints

Model card Files Files and versions Community

NP-NP commited on Dec 6, 2024

Commit

a721254

verified ·

1 Parent(s): cb87e23

Update README.md

Browse files

Files changed (1) hide show

README.md +16 -19

README.md CHANGED Viewed

@@ -1,60 +1,56 @@
 ---
 library_name: transformers
 base_model:
-- google/vit-base-patch16-224
 ---
-# Model Card for Model ID
-<!-- Provide a quick summary of what the model is/does. -->
 ## Model Details
-VIT model used for Pokemon type classification, also used in my CS 310 Final Project
-### Model Description
-<!-- Provide a longer summary of what this model is. -->
-This is the model card of a 🤗 transformers model that has been trained on Pokemon image dataset to classify the all 18 types of the Pokemon.
-- **Developed by:** Xianglu(Steven) Zhu
-- **Funded by [optional]:** [More Information Needed]
-- **Shared by [optional]:** [More Information Needed]
-- **Model type:** Vision Transformer for Regression
-## How to Get Started with the Model
-Use the code below to get started with the model
 import torch
 from PIL import Image
 import torchvision.transforms as transforms
 from transformers import ViTForImageClassification, ViTFeatureExtractor
 hf_model = ViTForImageClassification.from_pretrained("NP-NP/pokemon_model")
 hf_feature_extractor = ViTFeatureExtractor.from_pretrained("NP-NP/pokemon_model")
 transform = transforms.Compose([
     transforms.Resize((224, 224)),
     transforms.ToTensor(),
     transforms.Normalize(mean=hf_feature_extractor.image_mean, std=hf_feature_extractor.image_std)
 ])
 labels_dict = {
     'Grass': 0, 'Fire': 1, 'Water': 2, 'Bug': 3, 'Normal': 4, 'Poison': 5, 'Electric': 6,
     'Ground': 7, 'Fairy': 8, 'Fighting': 9, 'Psychic': 10, 'Rock': 11, 'Ghost': 12,
     'Ice': 13, 'Dragon': 14, 'Dark': 15, 'Steel': 16, 'Flying': 17
 }
 idx_to_label = {v: k for k, v in labels_dict.items()}
 image_path = "cute-pikachu-flowers-pokemon-desktop-wallpaper.jpg"
 image = Image.open(image_path).convert("RGB")
 input_tensor = transform(image).unsqueeze(0)  # shape: (1, 3, 224, 224)
 hf_model.eval()
 with torch.no_grad():
     outputs = hf_model(input_tensor)
@@ -63,6 +59,7 @@ with torch.no_grad():
 predicted_class = idx_to_label[predicted_class_idx]
 print("Predicted Pokémon type:", predicted_class)

 ---
 library_name: transformers
 base_model:
+  - google/vit-base-patch16-224
 ---
+# Model Card for Pokémon Type Classification
+This model leverages a Vision Transformer (ViT) to classify Pokémon images into 18 different types.
+It was developed as part of the CS 310 Final Project and trained on a Pokémon image dataset.
 ## Model Details
+- **Developer:** Xianglu (Steven) Zhu
+- **Purpose:** Pokémon type classification
+- **Model Type:** Vision Transformer (ViT) for image classification
+## Getting Started
+Here’s how you can use the model for classification:
+```python
 import torch
 from PIL import Image
 import torchvision.transforms as transforms
 from transformers import ViTForImageClassification, ViTFeatureExtractor
+# Load the pretrained model and feature extractor
 hf_model = ViTForImageClassification.from_pretrained("NP-NP/pokemon_model")
 hf_feature_extractor = ViTFeatureExtractor.from_pretrained("NP-NP/pokemon_model")
+# Define preprocessing transformations
 transform = transforms.Compose([
     transforms.Resize((224, 224)),
     transforms.ToTensor(),
     transforms.Normalize(mean=hf_feature_extractor.image_mean, std=hf_feature_extractor.image_std)
 ])
+# Mapping of labels to indices and vice versa
 labels_dict = {
     'Grass': 0, 'Fire': 1, 'Water': 2, 'Bug': 3, 'Normal': 4, 'Poison': 5, 'Electric': 6,
     'Ground': 7, 'Fairy': 8, 'Fighting': 9, 'Psychic': 10, 'Rock': 11, 'Ghost': 12,
     'Ice': 13, 'Dragon': 14, 'Dark': 15, 'Steel': 16, 'Flying': 17
 }
 idx_to_label = {v: k for k, v in labels_dict.items()}
+# Load and preprocess the image
 image_path = "cute-pikachu-flowers-pokemon-desktop-wallpaper.jpg"
 image = Image.open(image_path).convert("RGB")
 input_tensor = transform(image).unsqueeze(0)  # shape: (1, 3, 224, 224)
+# Make a prediction
 hf_model.eval()
 with torch.no_grad():
     outputs = hf_model(input_tensor)
 predicted_class = idx_to_label[predicted_class_idx]
 print("Predicted Pokémon type:", predicted_class)
+```