|
--- |
|
title: govgis_nov2023-slim-faiss |
|
emoji: π |
|
colorFrom: green |
|
colorTo: blue |
|
sdk: streamlit |
|
sdk_version: 1.29.0 |
|
app_file: app.py |
|
pinned: true |
|
license: mit |
|
--- |
|
|
|
# govgis_nov2023-slim-faiss |
|
|
|
[data:image/s3,"s3://crabby-images/7a4eb/7a4eb7dde90b3c6effc80e7c87d5259e805747df" alt="License: MIT"](https://opensource.org/licenses/MIT) |
|
[data:image/s3,"s3://crabby-images/eee9f/eee9f9577d49cd33e6a891e1a267cd0f189da122" alt="python"](https://www.python.org) |
|
|
|
[data:image/s3,"s3://crabby-images/f4767/f47678a82f77b2c72691322975428772c0b24085" alt="Push to HuggingFace Space"](https://github.com/joshuasundance-swca/govgis_nov2023-slim-faiss/actions/workflows/hf-space.yml) |
|
[data:image/s3,"s3://crabby-images/129b3/129b387c019ebefca321edf63ccb7c1455614484" alt="Open HuggingFace Space"](https://huggingface.co/spaces/joshuasundance/govgis_nov2023-slim-faiss) |
|
|
|
[data:image/s3,"s3://crabby-images/65ea5/65ea56a782a92978a9d1c30ea7b56b2bd4e42d2a" alt="pre-commit"](https://github.com/pre-commit/pre-commit) |
|
[data:image/s3,"s3://crabby-images/ef6f2/ef6f2e0b2690ac24aa85b0bbb2b688996b5040f1" alt="Ruff"](https://github.com/charliermarsh/ruff) |
|
[data:image/s3,"s3://crabby-images/ae90a/ae90a1b17cdb458fe3d891a0e5eca828a68a1e69" alt="Checked with mypy"](http://mypy-lang.org/) |
|
[data:image/s3,"s3://crabby-images/98647/986475842f2907062b79c4bb27fdd075d638e5b9" alt="Code style: black"](https://github.com/psf/black) |
|
|
|
[data:image/s3,"s3://crabby-images/d3868/d38687124456935c3cb2c405abeec2715b9372f9" alt="security: bandit"](https://github.com/PyCQA/bandit) |
|
|
|
|
|
# govgis_nov2023-slim-faiss |
|
|
|
π€ This README was written by GPT-4. π€ |
|
|
|
## Features |
|
|
|
- **Semantic Search on GIS Metadata**: Leverages the `govgis_nov2023` dataset to provide detailed insights into numerous GIS servers and layers. |
|
- **Natural Language Query Processing**: Uses Claude-Instant and Claude-2.1 models to interpret and rephrase user queries (optional). |
|
- **Advanced Document Retrieval**: Integrates FAISS vector store for efficient and relevant document retrieval based on query semantics. |
|
- **Customizable User Experience**: Sidebar controls to adjust search parameters and input fields for queries. |
|
|
|
## Dataset Overview |
|
|
|
- **Content**: The app is built around the `govgis_nov2023` dataset, which documents metadata from 1684 government ArcGIS servers, detailing almost a million individual layers. |
|
- **Unique Snapshot**: Provides a unique snapshot of these servers, with metadata including field information for feature layers and cell size for raster layers. |
|
|
|
## User Interface Guide |
|
|
|
- Adjust search settings like result limits and response generation parameters in the sidebar. |
|
- Securely enter your Anthropic API key for model access. |
|
- Submit natural language queries related to GIS data. |
|
|
|
## Contributions |
|
|
|
We welcome contributions. Please follow the standard fork and pull request process. |
|
|
|
## Support and Contact |
|
|
|
For support, please raise an issue on GitHub or in the HuggingFace space. |
|
|
|
## License |
|
|
|
This project is under the [MIT License](LICENSE.md). |
|
|
|
## Acknowledgments |
|
|
|
Thanks to the Huggingface and Streamlit communities, and special acknowledgment to Joseph Elfelt and the creators of the `restgdf` library for their contributions to the GIS field. |
|
|
|
|
|
## TODO |
|
- [ ] Add an open source model like `HuggingFaceH4/zephyr-7b-beta` |
|
- [ ] Hybrid search w/ bm25 or similar |
|
- [ ] Find a lightweight way to incorporate geospatial filtering |
|
- [ ] Add more parameters |
|
|