metadata
license: apache-2.0
datasets:
- HuggingFaceM4/the_cauldron
Fine-tune of https://huggingface.co/vikhyatk/moondream2 on a subset of the Cauldron, designed to improve visual question answering and reading of text off of natural images.
This small model is able to be hosted on smaller hardware, such as a Raspberry Pi.
More context on the model training can be found on the WandB logs and forthcoming git repo.
https://wandb.ai/noahpunintended/moondream-ft-picorder?nw=nwusernoahpunintended