You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

OLDA-Bench

An Interpretable AI System for Oral Leukoplakia Progression: From Early Screening to Lesion Delineation


Overview

Oral leukoplakia is one of the most common oral potentially malignant disorders (OPMDs) and represents a critical precursor to oral cancer. However, current diagnostic workflows largely rely on invasive biopsy procedures and subjective clinical assessment, limiting their suitability for large-scale screening and longitudinal monitoring.

To address these challenges, we introduce OLDA-Bench (Oral Leukoplakia Diagnosis and Analysis Benchmark), the first benchmark specifically designed for oral leukoplakia progression recognition and interpretable lesion assessment

Dataset Access

The dataset is available through the Hugging Face Dataset Hub:

https://huggingface.co/datasets/gy-chen/OLDA-Bench-Dataset

Access requests are reviewed manually and granted for academic and non-commercial research purposes only.


📦 Dataset Structure

Images identified as Leukoplakia or Leukoplakia with Cancer in the first stage are passed to the segmentation network. This stage uses a dataset in the standard COCO format, structured as follows:

./root_data/
│
├── train/
│ ├── xxx.jpg
│ ├── ...
├── val/
│ ├── xxx.jpeg
│ ├── ...
├── annotations
│ ├── train.json
│ ├── val.json

The instance segmentation task includes two categories:

  • Leukoplakia
  • Leukoplakia Cancer

The .json annotation files provide pixel-level segmentation masks and category labels for each lesion instance.


📊 Dataset Split

  • Training set: 389 images
  • Validation set: 44 images

🧪 Benchmark with MMdetection3.x & Detection2

🔹 Training

# Single-GPU training
python tools/train.py\
  work_dirs_mask2former_swin_b/mask2former_swin-s-p4-w7-224_8xb2-lsj-50e_coco.py

# Multi-GPU training
CUDA_VISIBLE_DEVICES=0,1,2,3 torchrun --nproc_per_node=4 ./tools/train.py\
  work_dirs_mask2former_swin_b/mask2former_swin-s-p4-w7-224_8xb2-lsj-50e_coco.py\
  --launcher pytorch

You can replace config with any supported architecture name from mmdetection3.x.

🔹 Evaluation

After training, evaluate a model checkpoint on the test set:

# Single-GPU evaluation
python tools/test.py\
  work_dirs_sparseinsts/sparseinst_r50_iam_8xb8-ms-270k_coco.py\
  work_dirs_sparseinsts/best_coco_segm_mAP_iter_8500.pth

# Multi-GPU evaluation
CUDA_VISIBLE_DEVICES=0,1,2,3 torchrun --nproc_per_node=4 tools/test.py \
    work_dirs_sparseinsts/sparseinst_r50_iam_8xb8-ms-270k_coco.py \
    work_dirs_sparseinsts/best_coco_segm_mAP_iter_8500.pth \
    --launcher pytorch 

🔹Additional Info

The OLDA-Bench is based on our previous work (environment code base) published in IEEE JBHI CDTM and OLPR.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support