MVRL
/

taxabind-vit-b-16

Zero-Shot Image Classification

Model card Files Files and versions Community

taxabind-vit-b-16 / README.md

Srikumar26's picture

Update README.md

6061697 verified about 2 months ago

|

history blame contribute delete

760 Bytes

	---
	tags:
	- clip
	library_name: open_clip
	pipeline_tag: zero-shot-image-classification
	license: mit
	---
	# Model card for taxabind-vit-b-16

	## Paper: TaxaBind: A Unified Embedding Space for Ecological Applications <br>
	## Venue: WACV 2025 <br>
	## Github: https://github.com/mvrl/TaxaBind

	## TaxaBind

	TaxaBind is a multimodal embedding space consisting of six modalities. This model contains image and text modalities in `open_clip` format. The model is used for zero-shot classification of species images using taxonomic text classes.

	## Usage

	```python
	import open_clip

	model, preprocess_train, preprocess_val = open_clip.create_model_and_transforms('hf-hub:MVRL/taxabind-vit-b-16')
	tokenizer = open_clip.get_tokenizer('hf-hub:MVRL/taxabind-vit-b-16')
	```