Instructions to use ArshVerma/mannsaathi-symptom-classifier-large with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use ArshVerma/mannsaathi-symptom-classifier-large with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-classification", model="ArshVerma/mannsaathi-symptom-classifier-large")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("ArshVerma/mannsaathi-symptom-classifier-large", dtype="auto") - PEFT
How to use ArshVerma/mannsaathi-symptom-classifier-large with PEFT:
Task type is invalid.
- Notebooks
- Google Colab
- Kaggle
Model Card for MannSaathi Symptom Classifier (Large)
A multilingual symptom classification model fine-tuned to understand how people in India actually describe health concerns — in Hindi, English, and Hinglish. Built as the AI core of MannSaathi, an anonymous health companion designed to help people overcome fear and hesitation around seeking healthcare.
Model Details
Model Description
This model classifies free-text symptom descriptions into 22+ symptom categories and outputs a calibrated risk score. It was fine-tuned using LoRA (Low-Rank Adaptation) on top of xlm-roberta-large, specifically trained to handle code-mixed Hindi-English (Hinglish) input, which most general-purpose medical NLP models fail to interpret correctly.
The model intentionally avoids alarming or diagnostic language in its design philosophy — it is built to inform and reassure, not to diagnose.
- Developed by: Arsh Verma
- Model type: Multi-label text classification (fine-tuned with LoRA)
- Language(s): Hindi, English, Hinglish (code-mixed)
- License: MIT
- Finetuned from model: xlm-roberta-large
Model Sources
- Repository: https://github.com/ArshVermaGit/mannsaathi
- Demo: https://mannsaathi-mukk.onrender.com
- Video walkthrough: https://www.youtube.com/watch?v=c58bVmaaMQ0
Uses
Direct Use
This model is intended to power conversational symptom-checking experiences where users describe how they're feeling in natural, informal language. It returns likely symptom categories and a risk indicator to help guide users toward an appropriate next step (self-care info, telemedicine, or in-person care).
Out-of-Scope Use
This model is not a diagnostic tool and must not be used as a substitute for professional medical advice, diagnosis, or treatment. It should not be deployed in any clinical decision-making system without human oversight. It is not validated for emergency triage.
Bias, Risks, and Limitations
The model was trained on a dataset skewed toward Hindi-English code-mixed text common in North Indian conversational patterns, and may underperform on other Indian languages or regional dialects. As with any symptom-classification model, false negatives on serious symptoms are possible — this model should always be paired with clear messaging encouraging users to seek professional care when uncertain.
Recommendations
Users (and downstream developers) should be made aware of the risks, biases, and limitations of the model. Always pair model output with clear next-step guidance and never present results as a clinical diagnosis.
How to Get Started with the Model
from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch
tokenizer = AutoTokenizer.from_pretrained("ArshVerma/mannsaathi-symptom-classifier-large")
model = AutoModelForSequenceClassification.from_pretrained("ArshVerma/mannsaathi-symptom-classifier-large")
text = "sir dard hai aur neend nahi aa rahi 3 din se"
inputs = tokenizer(text, return_tensors="pt", truncation=True, padding=True)
with torch.no_grad():
outputs = model(**inputs)
probs = torch.sigmoid(outputs.logits)
print(probs)
Training Details
Training Data
The model was trained on a custom-curated dataset of Hindi, English, and Hinglish health-related text, combining publicly available medical Q&A sources with manually labeled conversational symptom descriptions reflecting how people in India actually talk about their health.
Training Procedure
Fine-tuning was performed using LoRA (Low-Rank Adaptation) on top of xlm-roberta-large, allowing efficient training on limited GPU resources without updating all base model parameters.
Training Hyperparameters
- Training regime: Mixed precision (fp16)
- Base model: xlm-roberta-large
- Fine-tuning method: LoRA
- Hardware: Kaggle free-tier T4 GPU
Evaluation
Testing Data, Factors & Metrics
The model was evaluated on a held-out validation split of the curated dataset, measuring multi-label classification accuracy and F1 score across all 22+ symptom categories.
Metrics
- Accuracy
- F1 Score (macro-averaged)
Environmental Impact
Training was performed on a single Kaggle T4 GPU instance over a limited number of epochs, resulting in minimal estimated carbon impact compared to large-scale pretraining runs.
- Hardware Type: NVIDIA T4 GPU (Kaggle free tier)
- Cloud Provider: Kaggle (Google Cloud-backed)
- Compute Region: Unknown
Technical Specifications
Model Architecture and Objective
XLM-RoBERTa-large backbone with a fine-tuned classification head (via LoRA adapters) for multi-label symptom category prediction.
Compute Infrastructure
Hardware
Kaggle free-tier NVIDIA T4 GPU
Software
transformerspeft(for LoRA)torch
Citation
If you use this model, please consider citing the project:
@misc{mannsaathi2026,
author = {Arsh Verma},
title = {MannSaathi Symptom Classifier (Large)},
year = {2026},
url = {https://huggingface.co/ArshVerma/mannsaathi-symptom-classifier-large}
}
Model Card Contact
Model tree for ArshVerma/mannsaathi-symptom-classifier-large
Base model
FacebookAI/xlm-roberta-large