Spaces:

big-vision
/

paligemma-hf

Running

Different results between Jax Space and the HF Transformers Space

by Shalev - opened May 20, 2024

May 20, 2024

From https://huggingface.co/spaces/big-vision/paligemma - the Jax model works well.

But the https://huggingface.co/spaces/big-vision/paligemma-hf space just selects the entire image (on the same input). I'm trying to reproduce the (better) Jax behavior on HF transformers, but I can't figure out what's being done differently on the Jax side. Any tips would be appreciated!

codelion

May 23, 2024

Seeing similar issues, is there a difference in the HF version?

merve

May 23, 2024

@Shalev @codelion I will debug and come back to you on this

D-Anel

Jun 13, 2024

Hi, how can we decode the segmentation tokens into binary mask for object segmentation?

codelion

Jun 13, 2024

@D-Anel you can check the code here - https://huggingface.co/spaces/big-vision/paligemma-hf/blob/main/app.py#L43

D-Anel

Jun 13, 2024

@codelion Thank you

D-Anel

Jun 13, 2024

@merve Did you find any solution on why the HF version does not perform? I am having the same issue as @Shalev but in segmentation. It would return a mas of zeros in HF version while works pretty well on jax.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment