SupraMNiST-IMG-200k

Sumary

Task: Number-To-Image
Dataset: ylecun/mnist
Total training time: ~8 minutes
Inputs: Number (0-9) 
Outputs: 32x32 image
Params: ~201k
Framework: PyTorch, diffusers
Author: SupraLabs

Description

MNiST-IMG-200k is an ~200k parameter model trained to generate an image based on an input number (0-9).

Architecture

Parameter Value
image_size 32
in_channels 1
out_channels 1
num_classes 10
block_out_channels [12, 16]
layers_per_block 8
norm_num_groups 4

Training

Hardware

MNiST-IMG was trained on Google Colaboratory (NVIDA Tesla T4) for ~8 minutes with a batch size of 64 for 10 epochs.

Dataset

ylecun/mnist

Training Results

Loss ended at ~0.40.

Note: I can't provide the raw training logs as I loss it somehwere after training. Sorry!

Generation Examples

At 1000 decoding steps:

1000 Decoding Step Digit Image Generation

At 200 decoding steps:

200 Decoding Step Generation Image

Inference

Use the script in the repo. inference.py

Related Models

  1. MNIST-IMG-390k

Citation

@misc{mnist-img-390k,
  title     = {MNIST-IMG-390k: a Tiny Diffusion Model for Generating Handwritten Digits},
  author    = {Paul Courneya; Harley-ml},
  year      = {2026},
  url       = {https://huggingface.co/Harley-ml/MNIST-IMG-390k}
}
Downloads last month
28
Safetensors
Model size
201k params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train SupraLabs/SupraMNST-IMG-200k