Spaces:
Running
Running
readme: clickable model badge, sharper title + gradient
Browse files
README.md
CHANGED
|
@@ -1,19 +1,30 @@
|
|
| 1 |
---
|
| 2 |
-
title: Falcon
|
| 3 |
emoji: π¦
|
| 4 |
-
colorFrom:
|
| 5 |
-
colorTo:
|
| 6 |
sdk: static
|
| 7 |
pinned: false
|
| 8 |
license: apache-2.0
|
| 9 |
-
short_description: Open-
|
| 10 |
models:
|
| 11 |
- tiiuae/Falcon-Perception
|
| 12 |
- onnx-community/falcon-perception-onnx-webgpu
|
| 13 |
---
|
| 14 |
|
| 15 |
-
#
|
| 16 |
|
| 17 |
-
|
| 18 |
|
| 19 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
+
title: Falcon-Perception-0.6B WebGPU
|
| 3 |
emoji: π¦
|
| 4 |
+
colorFrom: indigo
|
| 5 |
+
colorTo: pink
|
| 6 |
sdk: static
|
| 7 |
pinned: false
|
| 8 |
license: apache-2.0
|
| 9 |
+
short_description: Open-vocab detection + segmentation, all in the browser
|
| 10 |
models:
|
| 11 |
- tiiuae/Falcon-Perception
|
| 12 |
- onnx-community/falcon-perception-onnx-webgpu
|
| 13 |
---
|
| 14 |
|
| 15 |
+
# π¦
Falcon-Perception-0.6B WebGPU
|
| 16 |
|
| 17 |
+
A browser demo for **[tiiuae/Falcon-Perception](https://huggingface.co/tiiuae/Falcon-Perception)** β a 0.6B open-vocabulary VLM that turns natural-language queries into bounding boxes and pixel-accurate segmentation masks, running fully client-side via WebGPU + ONNX Runtime Web.
|
| 18 |
|
| 19 |
+
[](https://huggingface.co/tiiuae/Falcon-Perception)
|
| 20 |
+
[](https://huggingface.co/onnx-community/falcon-perception-onnx-webgpu)
|
| 21 |
+
|
| 22 |
+
## What's inside
|
| 23 |
+
|
| 24 |
+
- **Detection** β draw bounding boxes for any natural-language query ("athletes", "the runner in front", "mangoes").
|
| 25 |
+
- **Segmentation** β pixel-accurate masks via the AnyUp upsampler, all in-browser.
|
| 26 |
+
- **Tracker (preview)** β HUD-style reticles on video. Limited by VLM latency between detections; see the in-space disclaimer.
|
| 27 |
+
|
| 28 |
+
## How it runs
|
| 29 |
+
|
| 30 |
+
2.4 GB of ONNX weights are fetched once on first visit, then cached by your browser β no backend, no API keys, no network round-trip after load. Multi-threaded WASM is enabled via `coi-serviceworker`.
|