Text-to-image finetuning - kerianheYi/CS245-fine-tunedSD5200_5600_14122

This pipeline was finetuned from kerianheyi/CS245-fine-tunedSD4800_5200_14122 on the jytjyt05/t_to_m7 dataset. Below are some example images generated with the finetuned pipeline using the following prompts: ['A melSpectrogram for piano solo in Major']:

val_imgs_grid

Pipeline usage

You can use the pipeline like so:

from diffusers import DiffusionPipeline
import torch

pipeline = DiffusionPipeline.from_pretrained("kerianheYi/CS245-fine-tunedSD5200_5600_14122", torch_dtype=torch.float16)
prompt = "A melSpectrogram for piano solo in Major"
image = pipeline(prompt).images[0]
image.save("my_image.png")

Training info

These are the key hyperparameters used during training:

  • Epochs: 1
  • Learning rate: 1e-05
  • Batch size: 1
  • Gradient accumulation steps: 4
  • Image resolution: 512
  • Mixed-precision: fp16
Downloads last month
0
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train kerianheYi/CS245-fine-tunedSD5200_5600_14122