Add model card for Sa2va-Instance-4B (Stage 1 of InstanceControl)

#1
by nielsr HF Staff - opened

This PR adds a comprehensive model card for Sa2va-Instance-4B, which is the Stage 1 model of the InstanceControl framework presented in the ECCV 2026 paper InstanceControl: Controllable Complex Image Generation without Instance Labeling.

The model card includes:

  • Metadata specifying the image-text-to-text pipeline tag and the transformers library name.
  • References and links to the paper, project page, GitHub repository, and dataset/benchmark.
  • Installation instructions and usage examples for performing Stage 1 inference to predict instance masks.
Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment