YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

QuADMET-Former

Pre-training on Quantum Mechanical Properties Improves ADMET Prediction

Paper GitHub License: MIT

Overview

QuADMET-Former demonstrates that pre-training Equivariant Graph Neural Networks (EGNNs) on quantum mechanical properties significantly improves the prediction of absorption, distribution, metabolism, excretion, and toxicological (ADMET) properties of small molecules.

Key contributions:

  • Pre-training TorchMD-NET on quantum mechanical properties (dipole moments, HOMO-LUMO gap, partial charges)
  • Novel E3FP pre-training strategy using 3D molecular fingerprints
  • Comprehensive evaluation on 23 ADMET benchmarks with challenging spectral splits
  • State-of-the-art performance: best on 16/23 datasets

Installation

Prerequisites

  • Python 3.10+
  • CUDA 12.1+ (for GPU support)

Setup

# Clone the repository
git clone https://github.com/your-username/quadmetformer.git
cd quadmetformer

# Create conda environment
conda env create -f environment.yml
conda activate quadmetformer

# Or install with pip
pip install -e .

Pre-trained Models

Pre-trained checkpoints are available on HuggingFace: https://huggingface.co/arunraja007/quadmetformer/tree/main

Usage

Pre-training

Pre-train on QMugs dataset with quantum mechanical targets:

cd pretraining
python pretrain.py --config configs/pretraining/qmugs.json

Fine-tuning on ADMET Tasks

Fine-tune a pre-trained model on downstream ADMET prediction:

cd finetuning
python downstream_train.py \
    --dataset Lipophilicity_AstraZeneca \
    --pretrained_ckpt path/to/checkpoint.ckpt \
    --split_type spectral

Reproducing Results

To reproduce the main results from the paper:

# 1. Pre-train on QMugs (or download pre-trained checkpoints)
cd pretraining
python pretrain.py --config configs/pretraining/qmugs.json

# 2. Run fine-tuning on all benchmarks
cd finetuning
python downstream_train.py --config configs/qmugs_single_conf.json --dataset all

# 3. Run baseline comparisons
python baseline_downstream_train.py --dataset all

Citation

If you find this work useful, please cite the following papers:

@article{arunraja2026quadmetformer,
author = {Arun Raja  and Hongtao Zhao  and Christian Tyrchan  and Eva Nittinger  and Michael M. Bronstein  and Charlotte M. Deane  and Garrett M. Morris },
title = {QuADMET-Former: Pre-training on Quantum Mechanical Properties Improves ADMET Prediction},
journal = {ChemRxiv},
volume = {2026},
number = {0427},
year = {2026},
doi = {10.26434/chemrxiv.15002429/v1}}
@inproceedings{
raja2024on,
title={On the Effectiveness of Quantum Chemistry Pre-training for Pharmacological Property Prediction},
author={Arun Raja and Hongtao Zhao and Christian Tyrchan and Eva Nittinger and Michael M. Bronstein and Charlotte Deane and Garrett M Morris},
booktitle={ICML 2024 AI for Science Workshop},
year={2024},
url={https://openreview.net/forum?id=ffSLXl666Q}
}

Acknowledgements

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support