RT-J Classifier (rt-j-clf)

The classification checkpoint of RT-J — a Relational Transformer foundation model for in-context / few-shot binary entity classification over multi-table relational databases (no per-task gradient training).

  • Task type: binary classification (metric: AUROC ↑)
  • Selected: SWA checkpoint at step 58,000 (best on validation AUROC)
  • Params: ~85.6M · dtype: bfloat16 · 12 blocks, d_model 512, 8 heads, d_ff 2048
  • Files: model.safetensors (weights), config.json (dims + text-embedding model + provenance)
from rt.checkpoints import load_rt_model
model, config = load_rt_model("star-project/rt-j-clf", device="cuda")

Full model card, training details, evaluation, license, and citation: see the RT-J repository. Use star-project/rt-j-reg for regression.

Downloads last month
-
Safetensors
Model size
85.6M params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Datasets used to train stanford-star/rt-j-clf

Evaluation results

  • Mean AUROC (single-context, L=8k, full test split) on RelBench (12 binary classification tasks)
    self-reported
    0.731