YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

DPA4-MatPES

DPA4-MatPES provides a series of DPA4 / SeZM machine-learning interatomic potentials trained on the MatPES R2SCAN 2025.2 dataset. The models are SO(3)-equivariant and conservative, and span the full periodic table. They are intended for inorganic materials within the chemical space covered by MatPES R2SCAN.

Overview

General information:

Item Value
Architecture DPA4 / SeZM
Backend PyTorch only
Precision float32
Elements Full periodic table (H–Og)
Cutoff radius 6.0 Å
Training data MatPES R2SCAN 2025.2 (347,889 frames)
Validation data MatPES R2SCAN 2025.2 (19,328 frames)
Trained with DeePMD-kit 3.2.0, PyTorch 2.11 / CUDA 12.8

Model variants:

Model Parameters
DPA4-Mini 655,504 (0.655 M)
DPA4-Neo 1,125,372 (1.125 M)
DPA4-Air 5,148,611 (5.149 M)

Released files:

File Description
DPA4-Mini-MatPES-v20260628.pt DPA4-Mini checkpoint.
DPA4-Mini-MatPES-v20260628.json DPA4-Mini training configuration.
DPA4-Neo-MatPES-v20260628.pt DPA4-Neo checkpoint.
DPA4-Neo-MatPES-v20260628.json DPA4-Neo training configuration.
DPA4-Air-MatPES-v20260628.pt DPA4-Air checkpoint.
DPA4-Air-MatPES-v20260628.json DPA4-Air training configuration.

The models predict atomic energies, from which forces and virials are obtained by differentiation:

$\qquad \mathbf{F}_i = -\frac{\partial E}{\partial \mathbf{r}_i}$

The descriptor propagates vector and higher-order angular (SO(3)-equivariant) features through message passing and reads out only the final l = 0 scalar to the energy fitting network. The architecture is described in the DeePMD-kit DPA4 documentation.

Model architecture

Names in parentheses are the corresponding DeePMD-kit configuration keys. The complete configurations, including the full type_map, are given in the released input files.

Network dimensions

Parameter DPA4-Mini DPA4-Neo DPA4-Air Description
Feature channels (channels) 32 32 64 Total channels per (l, m) coefficient.
Focus streams (n_focus) 1 2 1 Parallel focus streams inside the SO(2) convolution.
Interaction blocks (n_blocks) 2 2 3 Number of message-passing blocks.
SO(2) layers per block (so2_layers) 3 3 4 Number of SO(2) mixing layers per block.
FFN blocks (ffn_blocks) 1 1 1 Number of FFN sublayers per interaction block.
Environment seeding (use_env_seed) True True True Seed the initial node state with local-environment information via environment-matrix FiLM conditioning.

Radial basis and angular resolution

Parameter DPA4-Mini DPA4-Neo DPA4-Air Description
Radial basis (basis_type) bessel bessel bessel Radial basis type (bessel or gaussian).
Radial functions (n_radial) 16 16 16 Number of radial basis functions.
Max angular degree (lmax) 2 3 3 Maximum degree of the equivariant representation.
Max SO(2) order (mmax) 1 1 1 Maximum SO(2) order.

Equivariant interaction

Parameter DPA4-Mini DPA4-Neo DPA4-Air Description
Radial degree mixer (radial_so2_mode) degree_channel degree_channel degree_channel Dynamic radial degree-mixer mode in the SO(2) convolution; degree_channel uses a per-channel cross-degree radial kernel.
Mixer rank (radial_so2_rank) 1 1 1 Low-rank factorization rank for the degree_channel kernel (0 = full per-channel kernel).
Attention heads (n_atten_head) 1 1 1 Heads aggregating messages in the SO(2) convolution (0 = envelope-weighted scatter-sum).
SO(3) message branch (message_node_so3) True True True Enable the post-aggregation SO(3) Wigner-D grid-net branch (message as query, node state as context).
Normalization placement (sandwich_norm) [false, true, true, false] [false, true, true, false] [false, true, true, false] Pre/post-norm switches for residual branches, ordered [so2_pre, so2_post, ffn_pre, ffn_post].

Feed-forward and read-out

Parameter DPA4-Mini DPA4-Neo DPA4-Air Description
Activation (activation_function) silu silu silu Base activation for helper MLPs, the SO(2) gated path, and the output FFN.
GLU (glu_activation) True True True Base GLU switch for the FFN (e.g. SiLU → SwiGLU).
FFN width (ffn_neurons) 0 (auto) 0 (auto) 0 (auto) Hidden width for block and output FFNs; 0 resolves the width from channels.
FFN SO(3) grid (ffn_so3_grid) True True True Use the Wigner-D SO(3) grid in the block-internal FFN.
Grid-branch routing (grid_branch) [0, 0, 1] [0, 0, 1] [0, 0, 1] Scalar-routed polynomial product branches per grid path, ordered [node_wise, message_node, ffn].
SO(3) read-out (so3_readout) mlp mlp mlp Read-out mode for the final l=0 descriptor; mlp folds l>0 geometry into l=0 via a point-wise SO(3) grid MLP.
Fitting hidden width (fitting_net.neuron) [0] (auto) [0] (auto) [0] (auto) Fitting-net hidden-layer widths; 0 is an auto-width placeholder resolved from the descriptor width.

Cut-off and model size

Parameter DPA4-Mini DPA4-Neo DPA4-Air Description
Cut-off radius (rcut) 6.0 Å 6.0 Å 6.0 Å Neighbor cut-off radius.
Total parameters 655,504 (0.655 M) 1,125,372 (1.125 M) 5,148,611 (5.149 M) Trainable parameter count.

Precision and compilation

Parameter Value Description
Compute precision (precision) float32 Precision of the descriptor parameters.
Mixed precision (use_amp) True Automatic mixed precision with bfloat16 on CUDA during training.
TF32 matmul (enable_tf32) True TF32 matmul precision for CUDA training forwards (eval-time TF32 is controlled separately).
Compiled path (use_compile) True Use the compiled DPA4/SeZM training path.

Training

The models were trained with the conservative energy objective using dp --pt train.

Setting DPA4-Mini DPA4-Neo DPA4-Air
Epochs 300 250 200
Batch size filter:12000 filter:3000 filter:2000
Learning rate WSD schedule, start_lr = 1e-3stop_lr = 1e-6 WSD schedule, start_lr = 5e-4stop_lr = 1e-6 WSD schedule, start_lr = 3.5e-4stop_lr = 1e-6
Decay phase cosine decay over 65% of the schedule cosine decay over 65% of the schedule cosine decay over 65% of the schedule
Optimizer HybridMuon (weight_decay = 0.001) HybridMuon (weight_decay = 0.001) HybridMuon (weight_decay = 0.001)
Loss MAE energy loss with prefactors pref_e = 20, pref_f = 20, pref_v = 5 MAE energy loss with prefactors pref_e = 20, pref_f = 20, pref_v = 5 MAE energy loss with prefactors pref_e = 20, pref_f = 20, pref_v = 5

Validation

Errors are reported on the MatPES R2SCAN validation set.

Model Energy MAE/atom Energy RMSE/atom Force MAE Force RMSE Stress MAE Stress RMSE Training hoursᵃ
DPA4-Mini 21.99 meV 55.18 meV 110.23 meV/Š259.77 meV/Š3.830 meV/ų 8.236 meV/ų 9.9
DPA4-Neo 17.70 meV 52.91 meV 95.91 meV/Š225.23 meV/Š3.167 meV/ų 7.047 meV/ų 14.6
DPA4-Air 15.34 meV 50.95 meV 93.27 meV/Š225.06 meV/Š2.968 meV/ų 6.608 meV/ų 19.5
  • ᵃ Training hours are measured on a single NVIDIA H20 GPU.

Usage

DPA4/SeZM is a PyTorch-only model and uses the .pt2 (AOTInductor) export path; the TorchScript freeze path does not apply. Usage follows the DeePMD-kit DPA4 documentation.

Installation

DPA4/SeZM is available in the DeePMD-kit main branch (version 3.2.0). Install the latest code together with PyTorch 2.11 and CUDA 12.8:

pip install torch
pip install git+https://github.com/deepmodeling/deepmd-kit

For offline or backend-specific installation, see the DeePMD-kit installation guide. The compiled inference path (DP_COMPILE_INFER) additionally requires torch >= 2.11 and CUDA >= 12.6.

Evaluate a checkpoint

Run dp test on a released checkpoint with data in the standard or mixed-type DeePMD-kit data format. Replace <Mini|Neo|Air> with the selected model variant:

dp --pt test -m DPA4-<Mini|Neo|Air>-MatPES-v20260628.pt -s /path/to/test/system -n 1000

Freeze to .pt2 for deployment

dp --pt freeze -c DPA4-<Mini|Neo|Air>-MatPES-v20260628.pt -o frozen_model

The PyTorch backend detects DPA4/SeZM and writes frozen_model.pt2. The .pt2 is an AOTInductor archive and is target-specific: it depends on the host CPU/GPU, the GPU compute capability, and the libtorch version. Freeze on the target machine rather than reusing a .pt2 across different hardware.

Inference precision is fixed at freeze time. The .pt2 is a forward-only AOTInductor package, so the SO(2)-rotation kernel and matmul precision are compiled into the graph at export and are not re-evaluated when the file is later loaded by ASE or LAMMPS. Set the relevant environment variables before running dp --pt freeze:

Variable Default Effect
DP_TF32_INFER 0 (highest) float32 matmul precision: 0 highest, 1 high, 2 medium. Keep 0 for MD and other PES-smoothness-sensitive workflows.
DP_TRITON_INFER off Fused Triton SO(2) rotation kernel (CUDA). Retains full float32 accumulation, so it is safe for MD.

Accepted boolean values: 1/true/yes/on and 0/false/no/off.

Run in LAMMPS

The frozen .pt2 is used through pair_style deepmd. A minimal setup:

units           metal
atom_style      atomic
atom_modify     map yes

neighbor        2.0 bin
read_data       system.lmp

pair_style      deepmd frozen_model.pt2
pair_coeff      * * O H

atom_modify map yes is required: .pt2 graph inference relies on an explicit ghost/periodic-image to local-atom map, and the model fails fast if the map is absent. The element names after pair_coeff * * bind LAMMPS atom types to entries of the model's type_map in order (here types 1 and 2 to O and H); if omitted, the mapping falls back to the type_map stored in the .pt2 metadata.

Multi-GPU (MPI) inference uses the same .pt2. Launch one MPI rank per GPU and make every target device visible:

CUDA_VISIBLE_DEVICES=0,1,2,3 mpirun -np 4 lmp -in in.lammps

Run with ASE

from ase.io import read
from deepmd.calculator import DP

atoms = read("structure.cif")
atoms.calc = DP(model="frozen_model.pt2")

energy = atoms.get_potential_energy()
forces = atoms.get_forces()
stress = atoms.get_stress()

Adjust the energy bias for a new system

DFT energy labels differ between datasets by an arbitrary per-element constant. Before evaluating or simulating a system whose energy reference differs from MatPES R2SCAN, the per-element energy bias can be refit to that system without retraining any network weights, which often improves energy accuracy:

dp --pt change-bias DPA4-<Mini|Neo|Air>-MatPES-v20260628.pt -s /path/to/system

This updates only the energy shift and writes an adjusted checkpoint; the descriptor and fitting-net weights are unchanged.

Fine-tune on a downstream dataset

The released checkpoints serve as pretrained initializations. Start from the corresponding released input file, keep the model section (descriptor and fitting net) unchanged, replace type_map and the training/validation data with the downstream dataset, and use a small learning rate (e.g. start_lr = 1e-4):

dp --pt train input_finetune.json --finetune DPA4-<Mini|Neo|Air>-MatPES-v20260628.pt

DPA4/SeZM also supports LoRA adapters for single-task fine-tuning; the best checkpoints fold the LoRA deltas back into the base weights, producing a plain DPA4/SeZM checkpoint suitable for deployment.

Intended use and limitations

  • Intended for energy, force, and stress prediction of inorganic materials within the chemistry and configuration space covered by MatPES R2SCAN. Accuracy outside this domain should be validated before use.
  • PyTorch backend only; export is .pt2 (AOTInductor), and model compression is not supported.
  • Multi-GPU (MPI) LAMMPS inference applies to the plain energy model; ZBL zone bridging and spin models run on a single MPI rank.

Citation

If you use these models, please cite the DPA4 paper, DeePMD-kit, and the MatPES R2SCAN dataset.

@article{li2026dpa4,
  title   = {{DPA4}: Pushing the Accuracy-Cost Frontier of Interatomic
             Potentials with {EMFA} {SO(2)} Convolution},
  author  = {Li, Tiancheng and Li, Wentao and Peng, Anyang and Xue, Jianming
             and Zhang, Linfeng and Zhang, Duo and Wang, Han},
  journal = {arXiv preprint arXiv:2606.02419},
  year    = {2026},
  doi     = {10.48550/arXiv.2606.02419},
  url     = {https://arxiv.org/abs/2606.02419}
}

@article{Wang_ComputPhysCommun_2018_v228_p178,
  author    = {Wang, Han and Zhang, Linfeng and Han, Jiequn and E, Weinan},
  title     = {{DeePMD-kit: A deep learning package for many-body potential
               energy representation and molecular dynamics}},
  journal   = {Comput. Phys. Comm.},
  volume    = {228},
  pages     = {178--184},
  year      = {2018},
  doi       = {10.1016/j.cpc.2018.03.016}
}

@article{Zeng_JChemPhys_2023_v159_p054801,
  author    = {Jinzhe Zeng and Duo Zhang and Denghui Lu and Pinghui Mo and Zeyu
               Li and Yixiao Chen and Mari{\'a}n Rynik and Li'ang Huang and Ziyao
               Li and Shaochen Shi and Yingze Wang and Haotian Ye and Ping Tuo
               and Jiabin Yang and Ye Ding and Yifan Li and Davide Tisi and Qiyu
               Zeng and Han Bao and Yu Xia and Jiameng Huang and Koki Muraoka and
               Yibo Wang and Junhan Chang and Fengbo Yuan and Sigbj{\o}rn
               L{\o}land Bore and Chun Cai and Yinnian Lin and Bo Wang and Jiayan
               Xu and Jia-Xin Zhu and Chenxing Luo and Yuzhi Zhang and Rhys E A
               Goodall and Wenshuo Liang and Anurag Kumar Singh and Sikai Yao and
               Jingchao Zhang and Renata Wentzcovitch and Jiequn Han and Jie Liu
               and Weile Jia and Darrin M York and Weinan E and Roberto Car and
               Linfeng Zhang and Han Wang},
  title     = {{DeePMD-kit v2: A software package for deep potential models}},
  journal   = {J. Chem. Phys.},
  volume    = {159},
  issue     = {5},
  pages     = {054801},
  year      = {2023},
  doi       = {10.1063/5.0155600}
}

@article{Zeng_JChemTheoryComput_2025_v21_p4375,
  author    = {Jinzhe Zeng and Duo Zhang and Anyang Peng and Xiangyu Zhang and
               Sensen He and Yan Wang and Xinzijian Liu and Hangrui Bi and Yifan
               Li and Chun Cai and Chengqian Zhang and Yiming Du and Jia-Xin Zhu
               and Pinghui Mo and Zhengtao Huang and Qiyu Zeng and Shaochen Shi
               and Xuejian Qin and Zhaoxi Yu and Chenxing Luo and Ye Ding and
               Yun-Pei Liu and Ruosong Shi and Zhenyu Wang and Sigbj{\o}rn
               L{\o}land Bore and Junhan Chang and Zhe Deng and Zhaohan Ding and
               Siyuan Han and Wanrun Jiang and Guolin Ke and Zhaoqing Liu and
               Denghui Lu and Koki Muraoka and Hananeh Oliaei and Anurag Kumar
               Singh and Haohui Que and Weihong Xu and Zhangmancang Xu and
               Yong-Bin Zhuang and Jiayu Dai and Timothy J. Giese and Weile Jia
               and Ben Xu and Darrin M. York and Linfeng Zhang and Han Wang},
  title     = {{DeePMD-kit v3: A Multiple-Backend Framework for Machine Learning
               Potentials}},
  journal   = {J. Chem. Theory Comput.},
  volume    = {21},
  number    = {9},
  pages     = {4375--4385},
  year      = {2025},
  doi       = {10.1021/acs.jctc.5c00340}
}

@article{Kaplan_arXiv_2025_2503_04070,
  title         = {A Foundational Potential Energy Surface Dataset for Materials},
  author        = {Kaplan, Aaron D. and Liu, Runze and Qi, Ji and Ko, Tsz Wai
                   and Deng, Bowen and Riebesell, Janosh and Ceder, Gerbrand
                   and Persson, Kristin A. and Ong, Shyue Ping},
  journal       = {arXiv preprint arXiv:2503.04070},
  year          = {2025},
  doi           = {10.48550/arXiv.2503.04070},
  url           = {https://arxiv.org/abs/2503.04070}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Papers for OutisLi/DPA4-MatPES-v20260628