mtDNA-FM Pathogenicity Adapter (LoRA r=4)

LoRA adapter for pathogenic variant prediction (binary) on top of vthawfeek/mtdna-foundation-model.

Trained on ClinVar pathogenic mtDNA variants (positive class) vs gnomAD common variants AF > 0.01 (negative class).

Usage

from mtdna_fm.model.model import MtDNAForVariantPathogenicity, MtDNAModel
from peft import PeftModel

base = MtDNAModel.from_pretrained("vthawfeek/mtdna-foundation-model")
model = MtDNAForVariantPathogenicity(base)
model = PeftModel.from_pretrained(model, "vthawfeek/mtdna-fm-pathogenicity")
model.eval()

LoRA Configuration

  • r = 4, lora_alpha = 8
  • target_modules: query, key, value, dense
  • lora_dropout = 0.1
  • pos_weight = 2.5 in BCE loss (class imbalance correction)

Task

Binary pathogenicity prediction. Classifier uses the hidden state at the variant-position token (not the CLS token) — pathogenicity is a local property.

Limitations

Training data is limited (~2,000 ClinVar pathogenic + ~5,000 gnomAD negatives). Variants in underrepresented functional regions (tRNA, rRNA) may have lower accuracy.

Downloads last month
23
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for vthawfeek/mtdna-fm-pathogenicity

Adapter
(2)
this model