Eval Results
File size: 1,199 Bytes
607c8ae
 
 
 
 
9879269
607c8ae
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
---
license: apache-2.0
datasets:
- ILSVRC/imagenet-1k
model-index:
  - name: MaskGIT-Tokenizer-10bits
    results:
      - task:
          type: image-generation
        dataset:
          name: ILSVRC/imagenet-1k
          type: ILSVRC/imagenet-1k
        metrics:
          - name: rFID
            type: rFID
            value: 1.96
          - name: InceptionScore
            type: InceptionScore
            value: 178.3
          - name: LPIPS
            type: LPIPS
            value: 0.331
          - name: PSNR
            type: PSNR
            value: 18.6
          - name: SSIM
            type: SSIM
            value: 0.47
          - name: CodebookUsage
            type: CodebookUsage
            value: 1.0
---

This model is the MaskGIT tokenizer with a vocabulary size of 10bits adopted for the usage in the MaskBit codebase. It uses a downsampling factor of 16 and is trained on ImageNet for images of resolution 256.

You can find more details in the original [repository](https://github.com/google-research/maskgit) and in the [paper](https://arxiv.org/abs/2202.04200). All credits for this model belong to Huiwen Chang, Han Zhang, Lu Jiang, Ce Liu, and William T. Freeman.