RT-J Classifier (`rt-j-clf`)

The classification checkpoint of RT-J — a Relational Transformer foundation model for in-context / few-shot binary entity classification over multi-table relational databases (no per-task gradient training).

Task type: binary classification (metric: AUROC ↑)
Selected: SWA checkpoint at step 58,000 (best on validation AUROC)
Params: ~85.6M · dtype: bfloat16 · 12 blocks, d_model 512, 8 heads, d_ff 2048
Files: model.safetensors (weights), config.json (dims + text-embedding model + provenance)

from rt.checkpoints import load_rt_model
model, config = load_rt_model("star-project/rt-j-clf", device="cuda")

Full model card, training details, evaluation, license, and citation: see the RT-J repository. Use star-project/rt-j-reg for regression.

Downloads last month: -

Safetensors

Model size

85.6M params

Tensor type

BF16

Datasets used to train stanford-star/rt-j-clf

Evaluation results

Mean AUROC (single-context, L=8k, full test split) on RelBench (12 binary classification tasks)
self-reported

0.731

RT-J Classifier (rt-j-clf)

Datasets used to train stanford-star/rt-j-clf

Evaluation results

RT-J Classifier (`rt-j-clf`)