Supertron2-Reranker-8B: A Compact Cross-Encoder Reranking Model

Model Description

Supertron2-Reranker-8B is a reranking model built on top of Qwen/Qwen3-VL-Reranker-8B. It is designed to score query-document pairs for retrieval pipelines, search systems, and RAG applications where a stronger second-stage ranker is useful.

  • Developed by: Surpem
  • Model type: Cross-Encoder Reranker
  • Architecture: Qwen3-VL reranker, 8B parameters
  • License: Apache 2.0

Capabilities

Search Reranking

Supertron2-Reranker-8B can compare a user query against candidate passages and assign relevance scores. It is intended as a second-stage reranker after a faster retriever has already selected candidate documents.

RAG Pipelines

The model can help improve retrieval-augmented generation by pushing more relevant documents toward the top of the context window before answer generation.

Question-Document Matching

Supertron2-Reranker-8B is useful for matching questions to passages, snippets, help-center articles, documentation chunks, and other text candidates.

Instruction-Aware Retrieval

The model is prompted for relevance scoring, making it suitable for natural language search tasks where query intent matters.


Get Started

from sentence_transformers import CrossEncoder

model_id = "Surpem/Supertron2-Reranker-8B"

model = CrossEncoder(model_id)

pairs = [
    ("What is the capital of France?", "Paris is the capital and largest city of France."),
    ("What is the capital of France?", "Mars is often called the red planet."),
]

scores = model.predict(pairs)
print(scores)

Example reranking:

query = "How do I reset my password?"
documents = [
    "Use the account recovery page to reset your password.",
    "Our refund policy allows returns within 30 days.",
    "Two-factor authentication adds extra login security.",
]

results = model.rank(query, documents)
print(results)

Hardware Requirements

Precision Min VRAM Recommended
bfloat16 18 GB 24 GB+
4-bit quantized 6 GB 10 GB+

For larger batches or long documents, use more VRAM or reduce the batch size/max sequence length.


Intended Use

Supertron2-Reranker-8B is intended for:

  • Search reranking
  • RAG document reranking
  • Query-passage relevance scoring
  • Documentation and knowledge-base retrieval
  • Evaluation of candidate retrieval results

It is not intended to be used as a standalone chat model.


Limitations

  • The model scores relevance; it does not generate answers.
  • It should be evaluated on your own retrieval domain before production use.
  • Long documents may need chunking before reranking.
  • Relevance scores are relative and may not be calibrated across unrelated queries.
  • The model may still rank incorrect, outdated, or unsafe content highly if it appears textually relevant.

Citation

@misc{surpem2026supertron2-reranker-8b,
      title={Supertron2-Reranker-8B -- Compact Cross-Encoder Reranking Model},
      author={Surpem},
      year={2026},
      url={https://huggingface.co/Surpem/Supertron2-Reranker-8B},
}
Downloads last month
12
Safetensors
Model size
9B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Surpem/Supertron2-Reranker-8B

Finetuned
(1)
this model

Collection including Surpem/Supertron2-Reranker-8B