File size: 914 Bytes
7fb3833
 
 
 
f095cf5
 
 
 
 
 
e5069f1
f095cf5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
e5069f1
 
 
2c2d4fd
28e00e3
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
---
datasets:
- imagenet-1k
pipeline_tag: image-classification
---

## Model Architecture Details

### Architecture Overview

- **Architecture**: ViT Small

### Configuration

| Attribute            | Value          |
|----------------------|----------------|
| Patch Size           | 32             |
| Image Size           | 224            |
| Num Layers           | 3              |
| Attention Heads      | 4              |
| Objective Function   | CrossEntropy   |

### Performance

- **Validation Accuracy (Top 5)**: 0.4179
- **Validation Accuracy**: 0.2148

### Additional Resources

The model was trained using the library: [ViT-Prisma](https://github.com/soniajoseph/ViT-Prisma).\
For detailed metrics, plots, and further analysis of the model's training process, refer to the [training report](https://wandb.ai/perceptual-alignment/Imagenet/reports/ViT-Small-Imagenet-training-report--Vmlldzo3MDk3MTM5).