prithivMLmods's picture
Update README.md
f304c1b verified
metadata
tags:
  - text-to-image
  - lora
  - diffusers
  - template:diffusion-lora
widget:
  - text: >-
      sketch card, a close-up of a hand holding a card with a cartoon image of
      Mario on it. The card has a yellow background with a red cap and a red M
      on it, and the character is wearing blue overalls with a yellow button on
      the left side of his chest. The character is waving his left hand and has
      a big smile on his face. To the right of the card is a small cartoon
      character with a blue outfit and red hat. They are standing on a table
      with a white tablecloth. The table is adorned with small lights, adding a
      pop of color to the scene.
    output:
      url: images/SC1.png
  - text: >-
      sketch card, a hand is holding a small card with a drawing of three bears
      on it. The first bear is a panda, the second is a brown bear, and the
      third is a white bear. The bear on the left is wearing a gray and white
      striped shirt, while the third bear is in the middle of the three bears.
      The bears are facing each other, with their mouth open. The third bear has
      its head tilted to the left. The background is a gray wall with a row of
      windows in the upper left corner of the frame.
    output:
      url: images/SC2.png
  - text: >-
      sketch card, a hand is holding a small, square, white paper with a cartoon
      image of a yellow minion on it. The minion faces are drawn in a
      cartoon-like fashion, with big, round eyes, a wide smile, and a pair of
      eye-level glasses. The background of the image is a light blue, with Asian
      characters in a foreign language. To the right of the minions face, there
      is a white wall with multi-colored squares on it, adding a pop of color to
      the scene.
    output:
      url: images/SC3.png
  - text: >-
      sketch card, a hand is holding a white card with a cartoon drawing of a
      man in a gray jacket and a green shirt. The man has long black hair and a
      white face mask. His right hand is raised in the air, while his left hand
      is resting on his hip. The drawing is done in a simple, cartoon style. The
      background of the card is a collage of other cartoon drawings. To the
      right of the cards is a row of colored paints.
    output:
      url: images/SC4.png
base_model: black-forest-labs/FLUX.1-dev
instance_prompt: sketch card
license: creativeml-openrail-m

Flux.1-Dev-Sketch-Card-LoRA

Prompt
sketch card, a close-up of a hand holding a card with a cartoon image of Mario on it. The card has a yellow background with a red cap and a red M on it, and the character is wearing blue overalls with a yellow button on the left side of his chest. The character is waving his left hand and has a big smile on his face. To the right of the card is a small cartoon character with a blue outfit and red hat. They are standing on a table with a white tablecloth. The table is adorned with small lights, adding a pop of color to the scene.
Prompt
sketch card, a hand is holding a small card with a drawing of three bears on it. The first bear is a panda, the second is a brown bear, and the third is a white bear. The bear on the left is wearing a gray and white striped shirt, while the third bear is in the middle of the three bears. The bears are facing each other, with their mouth open. The third bear has its head tilted to the left. The background is a gray wall with a row of windows in the upper left corner of the frame.
Prompt
sketch card, a hand is holding a small, square, white paper with a cartoon image of a yellow minion on it. The minion faces are drawn in a cartoon-like fashion, with big, round eyes, a wide smile, and a pair of eye-level glasses. The background of the image is a light blue, with Asian characters in a foreign language. To the right of the minions face, there is a white wall with multi-colored squares on it, adding a pop of color to the scene.
Prompt
sketch card, a hand is holding a white card with a cartoon drawing of a man in a gray jacket and a green shirt. The man has long black hair and a white face mask. His right hand is raised in the air, while his left hand is resting on his hip. The drawing is done in a simple, cartoon style. The background of the card is a collage of other cartoon drawings. To the right of the cards is a row of colored paints.

The model is still in the training phase. This is not the final version and may contain artifacts and perform poorly in some cases.

Model description

prithivMLmods/Flux.1-Dev-Sketch-Card-LoRA

Image Processing Parameters

Parameter Value Parameter Value
LR Scheduler constant Noise Offset 0.03
Optimizer AdamW Multires Noise Discount 0.1
Network Dim 64 Multires Noise Iterations 10
Network Alpha 32 Repeat & Steps 14 & 1990
Epoch 16 Save Every N Epochs 1
Labeling: florence2-en(natural language & English)

Total Images Used for Training : 13

Best Dimensions

  • 768 x 1024 (Best)
  • 1024 x 1024 (Default)

Setting Up

import torch
from pipelines import DiffusionPipeline

base_model = "black-forest-labs/FLUX.1-dev"
pipe = DiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.bfloat16)

lora_repo = "prithivMLmods/Flux.1-Dev-Sketch-Card-LoRA"
trigger_word = "sketch card"  
pipe.load_lora_weights(lora_repo)

device = torch.device("cuda")
pipe.to(device)

Trigger words

You should use sketch card to trigger the image generation.

Download model

Weights for this model are available in Safetensors format.

Download them in the Files & versions tab.