parthraninga
/

contextClassifier

Model card Files Files and versions

contextClassifier / DEPLOYMENT_GUIDE.md

parthraninga's picture

Upload 8 files

c1f3888 verified 2 months ago

|

history blame contribute delete

3.23 kB

Hugging Face Spaces Deployment Guide

Steps to Deploy on Hugging Face Spaces

1. Create a New Space

Go to Hugging Face Spaces
Choose a name for your space (e.g., content-classifier)
Select Docker as the SDK
Set the space to Public or Private as needed
Click Create Space

2. Upload Files to Your Space

You need to upload these files to your Space repository:

contextClassifier.onnx    # Your ONNX model
app.py                   # FastAPI application
requirements.txt         # Python dependencies
Dockerfile              # Docker configuration
README.md               # This will become the Space's README

3. Required Files Content

For the Space's README.md header, add this at the top: ```yaml

title: Content Classifier emoji: 🔍 colorFrom: blue colorTo: purple sdk: docker pinned: false license: mit app_port: 7860


### 4. Deployment Process

1. **Via Git (Recommended):**
   ```bash
   git clone https://huggingface.co/spaces/YOUR_USERNAME/YOUR_SPACE_NAME
   cd YOUR_SPACE_NAME
   
   # Copy your files
   copy contextClassifier.onnx .
   copy app.py .
   copy requirements.txt .
   copy Dockerfile .
   
   # Commit and push
   git add .
   git commit -m "Add content classifier API"
   git push

Via Web Interface:
- Use the Files tab in your Space
- Upload each file individually
- Or drag and drop all files at once

5. Monitor Deployment

Go to your Space URL: https://huggingface.co/spaces/YOUR_USERNAME/YOUR_SPACE_NAME
Check the Logs tab to monitor the build process
The Space will show "Building" status during deployment
Once ready, you'll see the API documentation interface

6. Access Your API

Once deployed, your API will be available at:

Swagger UI: https://YOUR_USERNAME-YOUR_SPACE_NAME.hf.space/docs
API Endpoints:
- POST /predict - Main prediction endpoint
- GET /health - Health check
- GET /model-info - Model information

7. Example Usage

import requests

# Replace with your actual Space URL
api_url = "https://YOUR_USERNAME-YOUR_SPACE_NAME.hf.space"

# Make a prediction
response = requests.post(
    f"{api_url}/predict",
    json={"text": "This is a test message"}
)

print(response.json())

8. Important Notes

Model Size: Make sure your contextClassifier.onnx file is under the Space's size limit
Cold Start: The first request might take longer as the Space wakes up
Logs: Monitor the logs for any runtime errors
Updates: Any push to the repository will trigger a rebuild

9. Troubleshooting

Common Issues:

Build Fails: Check logs for dependency issues
Model Not Found: Ensure contextClassifier.onnx is in the root directory
Port Issues: Make sure the app uses port 7860
Memory Issues: Large models might exceed memory limits

Solutions:

Review requirements.txt for compatible versions
Check model file path in app.py
Verify Dockerfile exposes port 7860
Consider model optimization for deployment