Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -9,6 +9,8 @@ pinned: false
|
|
| 9 |
|
| 10 |
# ASID-Caption
|
| 11 |
|
|
|
|
|
|
|
| 12 |
We build **ASID-Caption**, a data-and-model suite for **fine-grained audiovisual video understanding**.
|
| 13 |
|
| 14 |
Our goal is to move beyond “one video → one generic caption” by providing **attribute-structured supervision** and **quality-verified annotations**, enabling models to produce **more complete, more controllable, and more temporally consistent** descriptions that cover both **visual content** and **audio cues**.
|
|
|
|
| 9 |
|
| 10 |
# ASID-Caption
|
| 11 |
|
| 12 |
+
[[🏠 Homepage]([https://](https://asid-caption.github.io/))] [[📖 Arxiv Paper](https://arxiv.org/pdf/2602.13013)] [[🤗 Models & Datasets](https://huggingface.co/AudioVisual-Caption)] [[💻 Code](https://github.com/HVision-NKU/ASID-Caption)]
|
| 13 |
+
|
| 14 |
We build **ASID-Caption**, a data-and-model suite for **fine-grained audiovisual video understanding**.
|
| 15 |
|
| 16 |
Our goal is to move beyond “one video → one generic caption” by providing **attribute-structured supervision** and **quality-verified annotations**, enabling models to produce **more complete, more controllable, and more temporally consistent** descriptions that cover both **visual content** and **audio cues**.
|