Edit model card

controlnet- JFoz/dog-cat-pose

Simple controlnet model made as part of the HF JaX/Diffusers community sprint.

These are controlnet weights trained on runwayml/stable-diffusion-v1-5 with pose conditioning generated using the animalpose model of OpenPifPaf.

Some example images can be found in the following

prompt: a tortoiseshell cat is sitting on a cushion images_0) prompt: a yellow dog standing on a lawn images_1)

Whilst not the dataset used for this model, a smaller dataset with the same format for conditioning images can be found at https://huggingface.co/datasets/JFoz/dog-poses-controlnet-dataset

The dataset was generated using the code at https://github.com/jfozard/animalpose/tree/f1be80ed29886a1314054b87f2a8944ea98997ac

Model Card for dog-cat-pose

This is an ControlNet model which allows users to control the pose of a dog or cat. Poses were extracted from images using the animalpose model of OpenPifPaf https://openpifpaf.github.io/intro.html . Skeleton colouring is as shown in the dataset. See also https://huggingface.co/JFoz/dog-pose

Model Details

Model Description

This is an ControlNet model which allows users to control the pose of a dog or cat. Poses were extracted from images using the animalpose model of OpenPifPaf https://openpifpaf.github.io/intro.html. Skeleton colouring is as shown in the dataset. See also https://huggingface.co/JFoz/dog-pose

Uses

Direct Use

Supply a suitable, potentially incomplete pose along with a relevant text prompt

Out-of-Scope Use

Generating images of non-animals. We advise retaining the stable diffusion safety filter when using this model.

Bias, Risks, and Limitations

The model is trained on a relatively small dataset, and may be overfit to those images.

Recommendations

Maintain careful supervision of model inputs and outputs.

Training Details

Training Data

Trained on a subset of Laion-5B using clip retrieval with the prompts "a photo of a (dog/cat) (standing/walking)"

Training Procedure

Preprocessing

Images were rescaled to 512 along their short edge and centrally cropped. The OpenPifPaf pose-detection model was used to extract poses, which were used to generate conditioning images.

Compute Infrastructure

TPUv4i

Software

Flax stable diffusion controlnet pipeline

Model Card Authors [optional]

John Fozard

Downloads last month
32
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Adapter for

Spaces using JFoz/dog-cat-pose 4