tonyyang2000
/

EdgeSAM

JingShiang Yang commited on Nov 14, 2025

Commit

2e11ff8

1 Parent(s): d5f470e

Fix mask output: use decoder output[1] and resize to 1024x1024

Files changed (1) hide show

handler.py CHANGED Viewed

@@ -54,12 +54,14 @@ class EndpointHandler:
                 'point_labels': labels.reshape(1, -1)
             })
-            masks = decoder_outputs[0]
-            # Postprocess - squeeze to get 2D mask
-            mask = masks.squeeze()  # Remove all dimensions of size 1
-            if len(mask.shape) > 2:
-                mask = mask[0]  # Take first mask if multiple
             mask = (mask > 0.0).astype(np.uint8) * 255
             # Return result

                 'point_labels': labels.reshape(1, -1)
             })
+            # decoder_outputs[0] is IoU scores (1, 4)
+            # decoder_outputs[1] is masks (1, 4, 256, 256)
+            masks = decoder_outputs[1]
+            # Take first mask and resize to 1024x1024
+            mask = masks[0, 0]  # Shape: (256, 256)
+            mask = Image.fromarray(mask).resize((1024, 1024), Image.BILINEAR)
+            mask = np.array(mask)
             mask = (mask > 0.0).astype(np.uint8) * 255
             # Return result