Text Classification
Transformers
Safetensors
English
deberta-v2
scientific-claim-verification
scifact
evidence-aware-verification
text-embeddings-inference
Instructions to use rishhh/verisci-claim-verifier with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use rishhh/verisci-claim-verifier with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-classification", model="rishhh/verisci-claim-verifier")# Load model directly from transformers import AutoTokenizer, AutoModelForSequenceClassification tokenizer = AutoTokenizer.from_pretrained("rishhh/verisci-claim-verifier") model = AutoModelForSequenceClassification.from_pretrained("rishhh/verisci-claim-verifier") - Notebooks
- Google Colab
- Kaggle
VeriSci Claim Verifier
DeBERTa v3 small sequence classifier for evidence-aware scientific claim verification over SUPPORTS, REFUTES, and NOT_ENOUGH_INFO.
Intended Use
Given a scientific claim and retrieved evidence passage, classify whether the evidence supports, refutes, or does not provide enough information. This model is not a medical device, not a substitute for peer review, and should not be used for clinical, legal, or public-policy decisions without expert review.
Training Data
- Primary verifier data:
allenai/scifact_entailmenttrain split. - Hard NEI examples:
andreiaalexa/scifact-relevance-pairstitle/trainnot_relevantpairs. - Validation/test policy: train-overlapping validation claims removed, then remaining validation rows partitioned by normalized claim key. Derived test was not used for training or model selection.
Evaluation
Validation accuracy: 0.4464 Validation macro F1: 0.4349
Derived test accuracy: 0.4024 Derived test macro F1: 0.3887
See evaluation/eval_summary.json and prediction JSONL files for class metrics, confusion matrices, and limitations.
Limitations
- The derived test split is small and originates from SciFact validation, not an official hidden test set.
- The model only judges the provided evidence; retrieval failures can change the end-to-end answer.
- Raw softmax confidence is not a calibrated probability.
- Training data is SciFact-derived and subject to license/domain constraints.
- Downloads last month
- 55