Instructions to use vthawfeek/mtdna-fm-pathogenicity with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use vthawfeek/mtdna-fm-pathogenicity with PEFT:
Task type is invalid.
- Notebooks
- Google Colab
- Kaggle
mtDNA-FM Pathogenicity Adapter (LoRA r=4)
LoRA adapter for pathogenic variant prediction (binary) on top of vthawfeek/mtdna-foundation-model.
Trained on ClinVar pathogenic mtDNA variants (positive class) vs gnomAD common variants AF > 0.01 (negative class).
Usage
from mtdna_fm.model.model import MtDNAForVariantPathogenicity, MtDNAModel
from peft import PeftModel
base = MtDNAModel.from_pretrained("vthawfeek/mtdna-foundation-model")
model = MtDNAForVariantPathogenicity(base)
model = PeftModel.from_pretrained(model, "vthawfeek/mtdna-fm-pathogenicity")
model.eval()
LoRA Configuration
- r = 4, lora_alpha = 8
- target_modules: query, key, value, dense
- lora_dropout = 0.1
- pos_weight = 2.5 in BCE loss (class imbalance correction)
Task
Binary pathogenicity prediction. Classifier uses the hidden state at the variant-position token (not the CLS token) — pathogenicity is a local property.
Limitations
Training data is limited (~2,000 ClinVar pathogenic + ~5,000 gnomAD negatives). Variants in underrepresented functional regions (tRNA, rRNA) may have lower accuracy.
- Downloads last month
- 23
Model tree for vthawfeek/mtdna-fm-pathogenicity
Base model
vthawfeek/mtdna-foundation-model