Spaces:

algoryn
/

dots-ocr-idcard

Paused

tommulder commited on Sep 9

Commit

f256ddd

1 Parent(s): 0a7e5ec

Add comprehensive .gitignore and merge README files for HF Spaces deployment

- Add comprehensive .gitignore with Python, ML/AI, and HF Spaces specific rules
- Merge README_HF_Deploy.md into main README.md
- Enhance README with HF Spaces best practices and deployment instructions
- Add emojis, better organization, and professional documentation structure
- Include step-by-step deployment guide and API usage examples

Files changed (3) hide show

.gitignore +256 -0
README.md +143 -22
README_HF_Deploy.md +0 -66

.gitignore ADDED Viewed

	@@ -0,0 +1,256 @@

+# Byte-compiled / optimized / DLL files
+__pycache__/
+*.py[cod]
+*$py.class
+# C extensions
+*.so
+# Distribution / packaging
+.Python
+build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+lib/
+lib64/
+parts/
+sdist/
+var/
+wheels/
+pip-wheel-metadata/
+share/python-wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+MANIFEST
+# PyInstaller
+#  Usually these files are written by a python script from a template
+#  before PyInstaller builds the exe, so as to inject date/other infos into it.
+*.manifest
+*.spec
+# Installer logs
+pip-log.txt
+pip-delete-this-directory.txt
+# Unit test / coverage reports
+htmlcov/
+.tox/
+.nox/
+.coverage
+.coverage.*
+.cache
+nosetests.xml
+coverage.xml
+*.cover
+*.py,cover
+.hypothesis/
+.pytest_cache/
+# Translations
+*.mo
+*.pot
+# Django stuff:
+*.log
+local_settings.py
+db.sqlite3
+db.sqlite3-journal
+# Flask stuff:
+instance/
+.webassets-cache
+# Scrapy stuff:
+.scrapy
+# Sphinx documentation
+docs/_build/
+# PyBuilder
+target/
+# Jupyter Notebook
+.ipynb_checkpoints
+# IPython
+profile_default/
+ipython_config.py
+# pyenv
+.python-version
+# pipenv
+#   According to pypa/pipenv#598, it is recommended to include Pipfile.lock in version control.
+#   However, in case of collaboration, if having platform-specific dependencies or dependencies
+#   having no cross-platform support, pipenv may install dependencies that don't work, or not
+#   install all needed dependencies.
+#Pipfile.lock
+# PEP 582; used by e.g. github.com/David-OConnor/pyflow
+__pypackages__/
+# Celery stuff
+celerybeat-schedule
+celerybeat.pid
+# SageMath parsed files
+*.sage.py
+# Environments
+.env
+.venv
+env/
+venv/
+ENV/
+env.bak/
+venv.bak/
+# Spyder project settings
+.spyderproject
+.spyproject
+# Rope project settings
+.ropeproject
+# mkdocs documentation
+/site
+# mypy
+.mypy_cache/
+.dmypy.json
+dmypy.json
+# Pyre type checker
+.pyre/
+# ML/AI specific
+# Model files and checkpoints
+*.pth
+*.pt
+*.pkl
+*.pickle
+*.h5
+*.hdf5
+*.joblib
+*.model
+*.ckpt
+*.safetensors
+# Data files
+*.csv
+*.json
+*.jsonl
+*.parquet
+*.feather
+*.arrow
+data/
+datasets/
+raw_data/
+processed_data/
+# Hugging Face specific
+.cache/
+huggingface_hub/
+transformers_cache/
+# OpenCV and image processing
+*.jpg
+*.jpeg
+*.png
+*.gif
+*.bmp
+*.tiff
+*.tif
+*.webp
+*.svg
+test_images/
+sample_images/
+uploads/
+temp_images/
+# IDE and editor files
+.vscode/
+.idea/
+*.swp
+*.swo
+*~
+.DS_Store
+Thumbs.db
+# OS generated files
+.DS_Store
+.DS_Store?
+._*
+.Spotlight-V100
+.Trashes
+ehthumbs.db
+Thumbs.db
+# Logs
+*.log
+logs/
+log/
+# Temporary files
+tmp/
+temp/
+.tmp/
+# Docker
+.dockerignore
+# Local configuration
+config.local.py
+settings.local.py
+.env.local
+.env.development
+.env.test
+.env.production
+# Backup files
+*.bak
+*.backup
+*.old
+# Runtime files
+*.pid
+*.sock
+# Coverage reports
+htmlcov/
+.coverage
+coverage.xml
+# Profiling
+*.prof
+# Jupyter notebook checkpoints
+.ipynb_checkpoints/
+# pytest
+.pytest_cache/
+# Ruff
+.ruff_cache/
+# Black
+.black/
+# isort
+.isort.cfg
+# Pre-commit
+.pre-commit-config.yaml
+# Local development
+local/
+dev/
+development/
+.cursor/
+docs/

README.md CHANGED Viewed

@@ -12,25 +12,47 @@ license: "private"
 # KYB Dots.OCR Text Extraction
-This Hugging Face Space provides a FastAPI endpoint for text extraction from identity documents using Dots.OCR with ROI (Region of Interest) support.
-## Features
-- **Text Extraction**: Extract text from identity documents using Dots.OCR
-- **ROI Support**: Process pre-cropped images or full images with ROI coordinates
-- **Field Mapping**: Structured field extraction with confidence scores
-- **MRZ Detection**: Machine Readable Zone data extraction
-- **Standardized API**: Consistent response format for integration
-## API Endpoints
-### Health Check
 ```
 GET /health
 ```
 ### Text Extraction
-```
 POST /v1/id/ocr
 Content-Type: multipart/form-data
@@ -38,7 +60,13 @@ file: <image_file>
 roi: {"x1": 0.0, "y1": 0.0, "x2": 1.0, "y2": 1.0} (optional)
 ```
-## Response Format
 ```json
 {
@@ -80,26 +108,119 @@ roi: {"x1": 0.0, "y1": 0.0, "x2": 1.0, "y2": 1.0} (optional)
 }
 ```
-## Usage
-1. Upload an image file (JPEG, PNG)
-2. Optionally provide ROI coordinates as JSON string
-3. Receive structured field extraction results
-## Environment Variables
 - `HF_DOTS_MODEL_PATH`: Path to Dots.OCR model weights
 - `HF_DOTS_CONFIDENCE_THRESHOLD`: Confidence threshold for field extraction
 - `HF_DOTS_DEVICE`: Device to use (auto, cpu, cuda)
 - `HF_DOTS_MAX_IMAGE_SIZE`: Maximum image size for processing
 - `HF_DOTS_MRZ_ENABLED`: Enable MRZ detection
-## Performance
-- **GPU**: 300-900ms processing time
-- **CPU**: 3-8s processing time
-- **Memory**: ~6GB per instance
-## Privacy
-This endpoint processes images temporarily and does not store or log personal information. All field values are redacted in logs for privacy protection.

 # KYB Dots.OCR Text Extraction
+This [Hugging Face Space](https://huggingface.co/docs/hub/spaces) provides a FastAPI endpoint for text extraction from identity documents using Dots.OCR with ROI (Region of Interest) support. Built as a Docker Space for maximum flexibility and performance.
+## 🚀 Quick Start
+### Using the API
+1. **Upload an image** (JPEG, PNG, or other supported formats)
+2. **Optionally specify ROI** coordinates for targeted extraction
+3. **Get structured results** with confidence scores and field mapping
+### Test the API
+```bash
+# Basic OCR test
+curl -X POST https://algoryn-dots-ocr-idcard.hf.space/v1/id/ocr \
+  -F "file=@test_image.jpg"
+# With ROI (region of interest)
+curl -X POST https://algoryn-dots-ocr-idcard.hf.space/v1/id/ocr \
+  -F "file=@test_image.jpg" \
+  -F 'roi={"x1":0.1,"y1":0.1,"x2":0.9,"y2":0.9}'
 ```
+## ✨ Features
+- **🔍 Text Extraction**: Extract text from identity documents using Dots.OCR
+- **📐 ROI Support**: Process pre-cropped images or full images with ROI coordinates
+- **📋 Field Mapping**: Structured field extraction with confidence scores
+- **🆔 MRZ Detection**: Machine Readable Zone data extraction
+- **🔌 Standardized API**: Consistent response format for integration
+- **🐳 Docker-based**: Full control over dependencies and environment
+- **⚡ GPU Support**: Optimized for Hugging Face Spaces GPU instances
+## 📡 API Endpoints
+### Health Check
+```http
 GET /health
 ```
+Returns service status and version information.
 ### Text Extraction
+```http
 POST /v1/id/ocr
 Content-Type: multipart/form-data
 roi: {"x1": 0.0, "y1": 0.0, "x2": 1.0, "y2": 1.0} (optional)
 ```
+**Parameters:**
+- `file`: Image file to process (required)
+- `roi`: JSON string with normalized coordinates (optional)
+  - `x1`, `y1`: Top-left corner (0.0 to 1.0)
+  - `x2`, `y2`: Bottom-right corner (0.0 to 1.0)
+## 📄 Response Format
 ```json
 {
 }
 ```
+## 🛠️ Deployment to Hugging Face Spaces
+### Prerequisites
+- [Hugging Face CLI](https://huggingface.co/docs/hub/install-huggingface-cli) installed
+- Docker installed locally (for testing)
+### 1. Create HF Space
+```bash
+# Login to Hugging Face
+huggingface-cli login
+# Create a new Docker Space
+huggingface-cli repo create dots-ocr-idcard --type space --space_sdk docker --organization algoryn
+```
+### 2. Clone and Setup
+```bash
+# Clone the space locally
+git clone https://huggingface.co/spaces/algoryn/dots-ocr-idcard
+cd dots-ocr-idcard
+# Copy required files
+cp /path/to/kybtech-ml-pipelines/docker/hf/dots-ocr/* .
+# Copy field extraction module
+mkdir -p src/idcard_api
+cp /path/to/kybtech-ml-pipelines/src/idcard_api/field_extraction.py src/idcard_api/
+touch src/idcard_api/__init__.py
+```
+### 3. Deploy
+```bash
+git add .
+git commit -m "Deploy Dots-OCR text extraction service"
+git push
+```
+### 4. Test Deployment
+The Space will be available at `https://algoryn-dots-ocr-idcard.hf.space` after deployment (usually 5-10 minutes).
+## ⚙️ Configuration
+### Environment Variables
 - `HF_DOTS_MODEL_PATH`: Path to Dots.OCR model weights
 - `HF_DOTS_CONFIDENCE_THRESHOLD`: Confidence threshold for field extraction
 - `HF_DOTS_DEVICE`: Device to use (auto, cpu, cuda)
 - `HF_DOTS_MAX_IMAGE_SIZE`: Maximum image size for processing
 - `HF_DOTS_MRZ_ENABLED`: Enable MRZ detection
+### Hugging Face Spaces Settings
+- **SDK**: Docker
+- **Port**: 7860 (default)
+- **Hardware**: CPU (upgradeable to GPU)
+- **Storage**: Persistent storage available for model caching
+## 📊 Performance
+| Hardware | Processing Time | Memory Usage |
+|----------|----------------|--------------|
+| **GPU** | 300-900ms | ~6GB |
+| **CPU** | 3-8s | ~2GB |
+## 🔒 Privacy & Security
+- **No Data Storage**: Images are processed temporarily and not stored
+- **Privacy Protection**: All field values are redacted in logs
+- **Secure Processing**: Runs in isolated Docker containers
+- **No Tracking**: No user data or usage analytics collected
+## 🐳 Local Development
+### Run with Docker
+```bash
+# Build the image
+docker build -t dots-ocr-api .
+# Run the container
+docker run -p 7860:7860 dots-ocr-api
+```
+### Run with Python
+```bash
+# Install dependencies
+pip install -r requirements.txt
+# Run the application
+python app.py
+```
+## 📚 Documentation
+- [Hugging Face Spaces Documentation](https://huggingface.co/docs/hub/spaces)
+- [Docker Spaces Guide](https://huggingface.co/docs/hub/spaces-sdks-docker)
+- [FastAPI Documentation](https://fastapi.tiangolo.com/)
+## 🤝 Contributing
+1. Fork the repository
+2. Create a feature branch
+3. Make your changes
+4. Test thoroughly
+5. Submit a pull request
+## 📄 License
+This project is licensed under a private license. See the license file for details.
+## 🆘 Support
+- **Issues**: Report bugs and request features via GitHub Issues
+- **Discussions**: Join the community discussions
+- **Email**: Contact us at website@huggingface.co for advanced support
+---
+Built with ❤️ using [Hugging Face Spaces](https://huggingface.co/docs/hub/spaces) and FastAPI

README_HF_Deploy.md DELETED Viewed

@@ -1,66 +0,0 @@
-# Dots-OCR Text Extraction Service - HF Deployment
-## Quick Deploy to Hugging Face Spaces
-### 1. Create HF Space
-```bash
-# Login to Hugging Face
-huggingface-cli login
-# Create a new Space
-huggingface-cli repo create dots-ocr-idcard --type space --space_sdk docker --organization algoryn
-```
-### 2. Prepare for Deployment
-```bash
-# Clone the space locally
-git clone https://huggingface.co/spaces/algoryn/dots-ocr-idcard
-cd dots-ocr-idcard
-# Copy required files
-cp /path/to/kybtech-ml-pipelines/docker/hf/dots-ocr/* .
-# Copy field extraction module (needed for structured extraction)
-mkdir -p src/idcard_api
-cp /path/to/kybtech-ml-pipelines/src/idcard_api/field_extraction.py src/idcard_api/
-touch src/idcard_api/__init__.py
-```
-### 3. Create app.py if needed
-The app.py file should already exist. Key features:
-- `/health` endpoint for health checks
-- `/v1/id/ocr` endpoint for text extraction
-- Supports ROI (Region of Interest) cropping
-- Structured field extraction
-- MRZ extraction support
-### 4. Push to HF
-```bash
-git add .
-git commit -m "Deploy Dots-OCR text extraction service"
-git push
-```
-### 5. Test the Deployment
-Once deployed (usually takes 5-10 minutes), test with:
-```bash
-# Basic OCR test
-curl -X POST https://algoryn-dots-ocr-idcard.hf.space/v1/id/ocr \
-  -H "Authorization: Bearer YOUR_HF_TOKEN" \
-  -F "file=@test_image.jpg"
-# With ROI (region of interest)
-curl -X POST https://algoryn-dots-ocr-idcard.hf.space/v1/id/ocr \
-  -H "Authorization: Bearer YOUR_HF_TOKEN" \
-  -F "file=@test_image.jpg" \
-  -F 'roi={"x1":0.1,"y1":0.1,"x2":0.9,"y2":0.9}'
-```
-## Environment Variables
-No special environment variables needed. The service runs on port 7860 by default.
-## Notes
-- Service includes mock mode if Dots-OCR fails to load
-- Health check available at `/health`
-- Structured field extraction included
-- MRZ parsing support built-in