BVRA
/

MegaDescriptor-L-384

Image Classification

wildlife-datasets

re-identification

Model card Files Files and versions Community

cermakvo commited on Dec 22, 2023

Commit

6e11691

•

1 Parent(s): 4221c4c

Update README.md

Files changed (1) hide show

README.md +6 -5

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ tags:
 library_name: wildlife-datasets
 license: cc-by-nc-4.0
 ---
-# Model card for MegaDescriptor-B-224
 A Swin-L image feature model. Superwisely pre-trained on animal re-identification datasets.
@@ -15,12 +15,13 @@ A Swin-L image feature model. Superwisely pre-trained on animal re-identificatio
 ## Model Details
 - **Model Type:** Animal re-identification / feature backbone
 - **Model Stats:**
-  - Params (M): ??
   - Image size: 384 x 384
 - **Papers:**
   - Swin Transformer: Hierarchical Vision Transformer using Shifted Windows --> https://arxiv.org/abs/2103.14030
 - **Original:** ??
-- **Pretrain Dataset:** All available re-identification datasets --> TBD
 ## Model Usage
 ### Image Embeddings
@@ -33,10 +34,10 @@ import torchvision.transforms as T
 from PIL import Image
 from urllib.request import urlopen
-model = timm.create_model("hf-hub:BVRA/wildlife-mega", pretrained=True)
 model = model.eval()
-train_transforms = T.Compose([T.Resize(224),
                               T.ToTensor(),
                               T.Normalize([0.5, 0.5, 0.5], [0.5, 0.5, 0.5])])

 library_name: wildlife-datasets
 license: cc-by-nc-4.0
 ---
+# Model card for MegaDescriptor-L-384
 A Swin-L image feature model. Superwisely pre-trained on animal re-identification datasets.
 ## Model Details
 - **Model Type:** Animal re-identification / feature backbone
 - **Model Stats:**
+  - Params (M): 228.8
   - Image size: 384 x 384
+  - Architecture: swin_large_patch4_window12_384
 - **Papers:**
   - Swin Transformer: Hierarchical Vision Transformer using Shifted Windows --> https://arxiv.org/abs/2103.14030
 - **Original:** ??
+- **Pretrain Dataset:** All available re-identification datasets --> https://github.com/WildlifeDatasets/wildlife-datasets
 ## Model Usage
 ### Image Embeddings
 from PIL import Image
 from urllib.request import urlopen
+model = timm.create_model("hf-hub:BVRA/MegaDescriptor-L-384", pretrained=True)
 model = model.eval()
+train_transforms = T.Compose([T.Resize(size=(384, 384)),
                               T.ToTensor(),
                               T.Normalize([0.5, 0.5, 0.5], [0.5, 0.5, 0.5])])