Instructions to use Willie999/Anidane with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use Willie999/Anidane with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("unsloth/medgemma-1.5-4b-it") model = PeftModel.from_pretrained(base_model, "Willie999/Anidane") - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- Unsloth Studio
How to use Willie999/Anidane with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for Willie999/Anidane to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for Willie999/Anidane to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for Willie999/Anidane to start chatting
Load model with FastModel
pip install unsloth from unsloth import FastModel model, tokenizer = FastModel.from_pretrained( model_name="Willie999/Anidane", max_seq_length=2048, )
Anidane (MedGemma 1.5 4B Fine-Tuned for Ophthalmic Analysis)
Anidane is a fine-tuned, multimodal vision-language model optimized for the analysis and diagnostic assessment of ophthalmic imagery, specifically fundus and retinal photography. Built on top of the google/medgemma-1.5-4b-it architecture and accelerated using Unsloth, Anidane bridges the gap in clinical accessibility by enabling parameter-efficient interpretation of localized eye pathologies and structural abnormalities.
Model Details
- Developed by: WILLIAMS OPOKU & SAMUEL AGYEMANG
- Model Type: Multimodal Vision-Language Model (Image-Text to Text)
- Base Model:
google/medgemma-1.5-4b-it(Gemma 3 / MedSigLIP Core) - Fine-Tuning Method: QLoRA (4-bit quantized) via Unsloth
- Primary Domain: Ophthalmology, Retinal Image Diagnostic Reporting
Intended Uses & Limitations
Intended Use
- Clinical Assistant Tool: Providing detailed descriptive impressions of fundus abnormalities (e.g., microaneurysms, vascular alterations, optic disc changes).
- Research & Education: Analyzing pathological trends in public ophthalmic imagery datasets, with an emphasis on mitigating geographic screening biases in underrepresented cohorts.
Limitations & Out of Scope
- Not a Standalone Diagnostic Tool: Anidane is designed strictly as an assistive reporting tool and must not be used for definitive medical diagnoses without certified practitioner oversight.
- Data Demographics: While optimized to interpret global clinical profiles, the model's accuracy is closely tied to the imaging quality and distribution found within the underlying train sets.
Training Details
Training Dataset
- Dataset Source:
smrndmdude/eye-diseases-dataset - Modality: High-resolution eye/fundus photography paired with localized clinical diagnostic captions.
Hyperparameters & Optimization
The model was fine-tuned using the following configurations:
- Quantization: 4-bit NormalFloat4 (NF4)
- LoRA Rank ($r$): 16
- LoRA Alpha ($\alpha$): 16
- Optimizer:
adamw_8bit - Learning Rate: $2 \times 10^{-4}$ (Linear decay)
- Gradient Accumulation Steps: 4
- Batch Size: 2 per device
- Total Steps: 59
Training Loss Curve Performance
During fine-tuning, the model displayed highly stable convergence characteristics, with training loss scaling down seamlessly as follows:
- Initial Phase (Step 1): 6.061622
- Mid Phase (Step 30): 0.171660
- Final Convergence Phase (Step 59): 0.080335
How to Use the Model
You can load and query Anidane using the Hugging Face transformers library or via unsloth for accelerated inference.
import torch
from PIL import Image
from transformers import AutoProcessor
from unsloth import FastVisionModel # Import FastVisionModel
model_id = "Willie999/Anidane"
processor = AutoProcessor.from_pretrained(model_id)
# Use FastVisionModel for loading the model, consistent with training
# FastVisionModel.from_pretrained returns a tuple (model, processor), so we unpack it.
model, _ = FastVisionModel.from_pretrained(
model_name = model_id, # Use model_name for FastVisionModel
torch_dtype=torch.bfloat16,
device_map="auto"
)
# Load a target retinal image
image = Image.open("/content/red-eye-disease.jpg").convert("RGB")
messages = [
{
"role": "user",
"content": [
{"type": "image"},
{"type": "text", "text": "Analyze this fundus/eye photograph. Provide a clear diagnostic assessment focusing on localized anomalies."}
]
}
]
prompt = processor.apply_chat_template(messages, add_generation_prompt=True)
inputs = processor(text=prompt, images=image, return_tensors="pt").to(model.device)
with torch.no_grad():
generated_ids = model.generate(
input_ids=inputs["input_ids"],
pixel_values=inputs["pixel_values"], # Explicitly pass pixel_values
max_new_tokens=256,
do_sample=False
)
response = processor.batch_decode(
generated_ids[:, inputs["input_ids"].shape[1]:],
skip_special_tokens=True
)[0]
print("Anidane Analysis:", response)
- Downloads last month
- 151
Model tree for Willie999/Anidane
Base model
google/medgemma-1.5-4b-it