Edit model card

pixart-sora-t2i

This is a full rank finetune derived from toilaluan/SoraT2I.

No validation prompt was used during training.

None

Validation settings

  • CFG: 7.5
  • CFG Rescale: 0.0
  • Steps: 30
  • Sampler: euler
  • Seed: 42
  • Resolution: 1024

Note: The validation settings are not necessarily the same as the training settings.

You can find some example images in the following gallery:

Prompt
unconditional (blank prompt)
Negative Prompt
blurry, cropped, ugly
Prompt
a woman sitting on the grass
Negative Prompt
blurry, cropped, ugly
Prompt
a professional photo headshot of a man in studio lighting
Negative Prompt
blurry, cropped, ugly
Prompt
a person holding a sign that reads 'SOON'
Negative Prompt
blurry, cropped, ugly
Prompt
Alien marketplace, bizarre creatures, exotic goods, vibrant colors, otherworldly atmosphere
Negative Prompt
blurry, cropped, ugly
Prompt
Child holding a balloon, happy expression, colorful balloons, sunny day, high detail
Negative Prompt
blurry, cropped, ugly
Prompt
a 4-panel comic strip showing an orange cat saying the words 'HELP' and 'LASAGNA'
Negative Prompt
blurry, cropped, ugly
Prompt
a hand is holding a comic book with a cover that reads 'The Adventures of Superhero'
Negative Prompt
blurry, cropped, ugly
Prompt
Underground cave filled with crystals, glowing lights, reflective surfaces, fantasy environment, high detail
Negative Prompt
blurry, cropped, ugly
Prompt
Bustling cyberpunk bazaar, vendors, neon signs, advanced tech, crowded, high detail
Negative Prompt
blurry, cropped, ugly
Prompt
Cyberpunk hacker in a dark room, neon glow, multiple screens, intense focus, high detail
Negative Prompt
blurry, cropped, ugly
Prompt
a cybernetic anne of green gables with neural implant and bio mech augmentations
Negative Prompt
blurry, cropped, ugly
Prompt
Post-apocalyptic cityscape, ruined buildings, overgrown vegetation, dark and gritty, high detail
Negative Prompt
blurry, cropped, ugly
Prompt
Magical castle in a lush forest, glowing windows, fantasy architecture, high resolution, detailed textures
Negative Prompt
blurry, cropped, ugly
Prompt
Ruins of an ancient temple in an enchanted forest, glowing runes, mystical creatures, high detail
Negative Prompt
blurry, cropped, ugly
Prompt
Mystical forest, glowing plants, fairies, magical creatures, fantasy art, high detail
Negative Prompt
blurry, cropped, ugly
Prompt
Magical garden with glowing flowers, fairies, serene atmosphere, detailed plants, high resolution
Negative Prompt
blurry, cropped, ugly
Prompt
Whimsical garden filled with fairies, magical plants, sparkling lights, serene atmosphere, high detail
Negative Prompt
blurry, cropped, ugly
Prompt
Majestic dragon soaring through the sky, detailed scales, dynamic pose, fantasy art, high resolution
Negative Prompt
blurry, cropped, ugly
Prompt
Fantasy world, floating islands in the sky, waterfalls, lush vegetation, detailed landscape, high resolution
Negative Prompt
blurry, cropped, ugly
Prompt
Futuristic city skyline at night, neon lights, cyberpunk style, high contrast, sharp focus
Negative Prompt
blurry, cropped, ugly
Prompt
Space battle scene, starships fighting, laser beams, explosions, cosmic background
Negative Prompt
blurry, cropped, ugly
Prompt
Abandoned fairground at night, eerie rides, ghostly figures, fog, dark atmosphere, high detail
Negative Prompt
blurry, cropped, ugly
Prompt
Spooky haunted mansion on a hill, dark and eerie, glowing windows, ghostly atmosphere, high detail
Negative Prompt
blurry, cropped, ugly
Prompt
a hardcover physics textbook that is called PHYSICS FOR DUMMIES
Negative Prompt
blurry, cropped, ugly
Prompt
Epic medieval battle, knights in armor, dynamic action, detailed landscape, high resolution
Negative Prompt
blurry, cropped, ugly
Prompt
Bustling medieval market with merchants, knights, and jesters, vibrant colors, detailed
Negative Prompt
blurry, cropped, ugly
Prompt
Cozy medieval tavern, warm firelight, adventurers drinking, detailed interior, rustic atmosphere
Negative Prompt
blurry, cropped, ugly
Prompt
Futuristic city skyline at night, neon lights, cyberpunk style, high contrast, sharp focus
Negative Prompt
blurry, cropped, ugly
Prompt
Forest with neon-lit trees, glowing plants, bioluminescence, surreal atmosphere, high detail
Negative Prompt
blurry, cropped, ugly
Prompt
Bright neon sign in a busy city street, 'Open 24 Hours', bold typography, glowing lights
Negative Prompt
blurry, cropped, ugly
Prompt
Retro diner sign, 'Joe's Diner', classic 1950s design, neon lights, weathered look
Negative Prompt
blurry, cropped, ugly
Prompt
Vintage store sign with elaborate typography, 'Antique Shop', hand-painted, weathered look
Negative Prompt
blurry, cropped, ugly

The text encoder was not trained. You may reuse the base model text encoder for inference.

Training settings

  • Training epochs: 0
  • Training steps: 1000
  • Learning rate: 8e-06
  • Effective batch size: 128
    • Micro-batch size: 32
    • Gradient accumulation steps: 4
    • Number of GPUs: 1
  • Prediction type: epsilon
  • Rescaled betas zero SNR: False
  • Optimizer: AdamW, stochastic bf16
  • Precision: Pure BF16
  • Xformers: Enabled

Datasets

mj-v6

  • Repeats: 0
  • Total number of images: 134144
  • Total number of aspect buckets: 1
  • Resolution: 1.0 megapixels
  • Cropped: False
  • Crop style: None
  • Crop aspect: None

Inference

import torch
from diffusers import DiffusionPipeline



model_id = "pixart-sora-t2i"
prompt = "An astronaut is riding a horse through the jungles of Thailand."
negative_prompt = "malformed, disgusting, overexposed, washed-out"

pipeline = DiffusionPipeline.from_pretrained(model_id)
pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu')
image = pipeline(
    prompt=prompt,
    negative_prompt='blurry, cropped, ugly',
    num_inference_steps=30,
    generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826),
    width=1152,
    height=768,
    guidance_scale=7.5,
    guidance_rescale=0.0,
).images[0]
image.save("output.png", format="PNG")
Downloads last month
3
Inference Examples
Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for toilaluan/pixart-sora-t2i

Base model

toilaluan/SoraT2I
Finetuned
this model