ποΈ BLIP Image Captioning for Municipality Use (Graduation Project)
This is a fine-tuned version of Salesforce's BLIP base model, customized for generating captions on images related to street infrastructure and urban environments.
The model was developed as part of a Graduation Project at the College of Information Systems.
π Graduation Project Information
- Project Title: Smart Captioning for Urban Monitoring Using AI
- Student Name: Saja
- University Major: Information Systems
- Semester: Final Year β Graduation Project 1
- Purpose: This model aims to assist municipalities in automatically generating descriptive captions for street and infrastructure images using AI.
π§ Model Overview
- Base Model: Salesforce/blip-image-captioning-base
- Architecture: BLIP (Bootstrapped Language Image Pretraining)
- Task: Image Captioning (with a focus on municipality-related data)
ποΈ Dataset
This model was fine-tuned on a custom dataset consisting of images captured in urban environments (e.g., roads, signs, sidewalks) for the purpose of city infrastructure monitoring.
β οΈ The dataset is not publicly released due to privacy and data ownership considerations.
β Intended Use
This model is designed for:
- Generating captions for street-level and city infrastructure images.
- Assisting municipalities in monitoring, analyzing, and documenting visual data.
β Limitations
- May not generalize well to domains outside of urban/street imagery.
- Captions might be biased by the dataset or lack context in unfamiliar scenes.
- Does not include object detection β focuses on captioning only.
π License
- This model is released under the MIT License.
- It is based on the Salesforce BLIP model, which is licensed under the BSD-3-Clause License.
- Please ensure appropriate credit is given to the original authors when using or redistributing this model.
π€ Acknowledgements
- Special thanks to Salesforce Research for the original BLIP model.
- Developed as part of a university graduation project with guidance from faculty members.
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
π
Ask for provider support
HF Inference deployability: The model has no library tag.
Model tree for saja003/MuniVis
Base model
Salesforce/blip-image-captioning-base