PandA / networks /genforce /MODEL_ZOO.md
james-oldfield's picture
Upload 194 files
2a76164

A newer version of the Gradio SDK is available: 4.40.0

Upgrade

Model Zoo

Pre-trained Models

First of all, we thank the following repositories for their work on high-quality image synthesis

Please download the models you need and save them to checkpoints/.

StyleGAN Official
Model (Dataset) Training Samples Training Duration (K Images) FID
ffhq-1024x1024 70,000 25,000 4.40
celebahq-1024x1024 30,000 25,000 5.06
bedroom-256x256 3,033,042 70,000 2.65
cat-256x256 1,657,266 70,000 8.53
car-512x384 5,520,756 46,000 3.27
StyleGAN Ours
Model (Dataset) Training Samples Training Duration (K Images) FID
Face ("partial" means faces are not fully aligned to center)
celeba_partial-256x256 103,706 50,000 7.03
ffhq-256x256 70,000 25,000 5.70
ffhq-512x512 70,000 25,000 5.15
LSUN Indoor Scene
livingroom-256x256 1,315,802 30,000 5.16
diningroom-256x256 657,571 25,000 4.13
kitchen-256x256 1,000,000 30,000 5.06
LSUN Indoor Scene Mixture
apartment-256x256 4 * 200,000 60,000 4.18
LSUN Outdoor Scene
church-256x256 126,227 30,000 4.82
tower-256x256 708,264 30,000 5.99
bridge-256x256 818,687 25,000 6.42
LSUN Other Scene
restaurant-256x256 626,331 50,000 4.03
classroom-256x256 168,103 50,000 10.10
conferenceroom-256x256 229,069 50,000 6.20
StyleGAN Third-Party
Model (Dataset) Source
animeface-512x512 link
animeportrait-512x512 link
artface-512x512 link
StyleGAN2 Official
Model (Dataset) Training Samples Training Duration (K Images) FID
ffhq-1024x1024 70,000 25,000 2.84
church-256x256 126,227 48,000 3.86
cat-256x256 1,657,266 88,000 6.93
horse-256x256 2,000,340 100,000 3.43
car-512x384 5,520,756 57,000 2.32

Training Datasets

  • MNIST (60,000 training samples and 10,000 test samples on 10 digital numbers)
  • SVHN (73,257 training samples, 26,032 testing samples, and 531,131 additional samples on 10 digital numbers)
  • CIFAR10 (50,000 training samples and 10,000 test samples on 10 classes)
  • CIFAR100 (50,000 training samples and 10,000 test samples on 100 classes)
  • ImageNet (1,281,167 training samples, 50,000 validation samples, and 100,100 testing samples on 1000 classes)
  • CelebA (202,599 samples from 10,177 identities, with 5 landmarks and 40 binary facial attributes)
  • CelebA-HQ (30,000 samples)
  • FF-HQ (70,000 samples)
  • LSUN (see statistical information below)
  • Places (around 1.8M training samples covering 365 classes)
  • Cityscapes (2,975 training samples, 19998 extra training samples (one broken), 500 validation samples, and 1,525 test samples)
  • Streetscapes

Statistical information of LSUN dataset is summarized as follows:

LSUN Datasets Stats
Name Number of Samples Size
Scenes
bedroom (train) 3,033,042 43G
bridge (train) 818,687 15G
churchoutdoor (train) 126,227 2G
classroom (train) 168,103 3G
conferenceroom (train) 229,069 4G
diningroom (train) 657,571 11G
kitchen (train) 2,212,277 33G
livingroom (train) 1,315,802 21G
restaurant (train) 626,331 13G
tower (train) 708,264 11G
Objects
airplane 1,530,696 34G
bicycle 3,347,211 129G
bird 2,310,362 65G
boat 2,651,165 86G
bottle 3,202,760 64G
bus 695,891 24G
car 5,520,756 173G
cat 1,657,266 42G
chair 5,037,807 116G
cow 377,379 15G
diningtable 1,537,123 48G
dog 5,054,817 145G
horse 2,000,340 69G
motorbike 1,194,101 42G
person 18,890,816 477G
pottedplant 1,104,859 43G
sheep 418,983 18G
sofa 2,365,870 56G
train 1,148,020 43G
tvmonitor 2,463,284 46G