File size: 914 Bytes
7fb3833 f095cf5 e5069f1 f095cf5 e5069f1 2c2d4fd 28e00e3 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 |
---
datasets:
- imagenet-1k
pipeline_tag: image-classification
---
## Model Architecture Details
### Architecture Overview
- **Architecture**: ViT Small
### Configuration
| Attribute | Value |
|----------------------|----------------|
| Patch Size | 32 |
| Image Size | 224 |
| Num Layers | 3 |
| Attention Heads | 4 |
| Objective Function | CrossEntropy |
### Performance
- **Validation Accuracy (Top 5)**: 0.4179
- **Validation Accuracy**: 0.2148
### Additional Resources
The model was trained using the library: [ViT-Prisma](https://github.com/soniajoseph/ViT-Prisma).\
For detailed metrics, plots, and further analysis of the model's training process, refer to the [training report](https://wandb.ai/perceptual-alignment/Imagenet/reports/ViT-Small-Imagenet-training-report--Vmlldzo3MDk3MTM5).
|