tanganke
/

clip-vit-large-patch14_sun397

Feature Extraction

clip_vision_model

Inference Endpoints

Model card Files Files and versions Community

Edit model card

Model Card

Model Details

Architecture: ViT-Large with patch size 14
Training Data: SUN397 dataset

Training Details

Adam Optimizer with a constant learning rate 1e-5 for 4000 steps training (batch_size=32). Only the vision encoder is fine-tuned.

Evaluation Results

pre-trained: 0.6830110549926758
fine-tuned: 0.8275973796844482

Downloads last month: 68

Safetensors

Model size

303M params

Tensor type

F32

·

Finetuned from

Dataset used to train tanganke/clip-vit-large-patch14_sun397

Collection including tanganke/clip-vit-large-patch14_sun397

CLIP-ViT-L/14 on the eight image classification tasks

8 items • Updated 10 days ago