RT-J (rt-j)

RT-J is a Relational Transformer foundation model for in-context / few-shot entity prediction over multi-table relational databases (no per-task gradient training). This repository hosts both task-head variants:

Variant Folder Task Metric Selected checkpoint
Classifier classification/ binary entity classification AUROC ↑ (mean 0.7310 on 12 RelBench tasks) SWA @ step 58,000 (best val AUROC)
Regressor regression/ entity regression MAE ↓ (mean 0.2677 on 9 RelBench tasks, Z-scored) SWA @ step 18,000 (best val MAE)

Both variants share the same architecture and training recipe: ~85.6M params · bfloat16 · 12 blocks, d_model 512, 8 heads, d_ff 2048 · text columns embedded with all-MiniLM-L12-v2 (d_text 384). Each folder contains model.safetensors (weights) and config.json (dims + text-embedding model + provenance).

Usage

from huggingface_hub import hf_hub_download

# shared architecture + variant map (also what the Hub counts as a download)
config = hf_hub_download("stanford-star/rt-j", "config.json")

# pick the variant: "classification" or "regression"
weights = hf_hub_download("stanford-star/rt-j", "classification/model.safetensors")
variant_config = hf_hub_download("stanford-star/rt-j", "classification/config.json")

Related

Downloads last month
3
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Datasets used to train stanford-star/rt-j

Evaluation results

  • Mean AUROC (single-context, L=8k, full test split) on RelBench (12 binary classification tasks)
    self-reported
    0.731
  • Mean MAE (single-context, L=8k, full test split) on RelBench (9 regression tasks)
    self-reported
    0.268