RT-J (`rt-j`)

RT-J is a Relational Transformer foundation model for in-context / few-shot entity prediction over multi-table relational databases (no per-task gradient training). This repository hosts both task-head variants:

Variant	Folder	Task	Metric	Selected checkpoint
Classifier	`classification/`	binary entity classification	AUROC ↑ (mean 0.7310 on 12 RelBench tasks)	SWA @ step 58,000 (best val AUROC)
Regressor	`regression/`	entity regression	MAE ↓ (mean 0.2677 on 9 RelBench tasks, Z-scored)	SWA @ step 18,000 (best val MAE)

Both variants share the same architecture and training recipe: ~85.6M params · bfloat16 · 12 blocks, d_model 512, 8 heads, d_ff 2048 · text columns embedded with all-MiniLM-L12-v2 (d_text 384). Each folder contains model.safetensors (weights) and config.json (dims + text-embedding model + provenance).

Usage

from huggingface_hub import hf_hub_download

# shared architecture + variant map (also what the Hub counts as a download)
config = hf_hub_download("stanford-star/rt-j", "config.json")

# pick the variant: "classification" or "regression"
weights = hf_hub_download("stanford-star/rt-j", "classification/model.safetensors")
variant_config = hf_hub_download("stanford-star/rt-j", "classification/config.json")

Training data: stanford-star/the-join
Evaluation: RelBench
More models from the team: stanford-star

Downloads last month: 3

Datasets used to train stanford-star/rt-j

Evaluation results

Mean AUROC (single-context, L=8k, full test split) on RelBench (12 binary classification tasks)
self-reported

0.731
Mean MAE (single-context, L=8k, full test split) on RelBench (9 regression tasks)
self-reported

0.268

stanford-star
/

rt-j

RT-J (`rt-j`)

Usage

Related

Datasets used to train stanford-star/rt-j

Evaluation results

RT-J (rt-j)

Usage

Related

Datasets used to train stanford-star/rt-j

Evaluation results

RT-J (`rt-j`)