Eval Results
maskgit_tokenizer / README.md
markweber's picture
Update README.md
9879269 verified
|
raw
history blame
1.2 kB
---
license: apache-2.0
datasets:
- ILSVRC/imagenet-1k
model-index:
- name: MaskGIT-Tokenizer-10bits
results:
- task:
type: image-generation
dataset:
name: ILSVRC/imagenet-1k
type: ILSVRC/imagenet-1k
metrics:
- name: rFID
type: rFID
value: 1.96
- name: InceptionScore
type: InceptionScore
value: 178.3
- name: LPIPS
type: LPIPS
value: 0.331
- name: PSNR
type: PSNR
value: 18.6
- name: SSIM
type: SSIM
value: 0.47
- name: CodebookUsage
type: CodebookUsage
value: 1.0
---
This model is the MaskGIT tokenizer with a vocabulary size of 10bits adopted for the usage in the MaskBit codebase. It uses a downsampling factor of 16 and is trained on ImageNet for images of resolution 256.
You can find more details in the original [repository](https://github.com/google-research/maskgit) and in the [paper](https://arxiv.org/abs/2202.04200). All credits for this model belong to Huiwen Chang, Han Zhang, Lu Jiang, Ce Liu, and William T. Freeman.