Kalbe Farma - Visual Question Answering (VQA) for Medical Imaging

Overview

The project addresses the challenge of accurate and efficient medical imaging analysis in healthcare, aiming to reduce human error and workload for radiologists. The proposed solution involves developing advanced AI models for Visual Question Answering (VQA) to assist healthcare professionals in analyzing medical images quickly and accurately. These models will be integrated into a user-friendly web application, providing a practical tool for real-world healthcare settings.

Dataset

The model is trained using the Hugging face.

Reference: ScienceDirect

Model Architecture

The model uses a Parameterized Hypercomplex Shared Encoder network (PHYSEnet).

Model Architecture

Reference: ScienceDirect

Demo

Please select the example below or upload 4 pairs of mammography exam results.

Usage


cd src

Run the following command below:
python app.py

        

Check out the configuration reference at Hugging Face