Image-Text-to-Text
MLX
Safetensors
PaddleOCR
English
Chinese
multilingual
paddleocr_vl
ERNIE4.5
PaddlePaddle
image-to-text
ocr
conversational
custom_code
4-bit precision
Instructions to use huggingfinger0/PaddleOCR-VL-1.5-4bit with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use huggingfinger0/PaddleOCR-VL-1.5-4bit with MLX:
# Make sure mlx-vlm is installed # pip install --upgrade mlx-vlm from mlx_vlm import load, generate from mlx_vlm.prompt_utils import apply_chat_template from mlx_vlm.utils import load_config # Load the model model, processor = load("huggingfinger0/PaddleOCR-VL-1.5-4bit") config = load_config("huggingfinger0/PaddleOCR-VL-1.5-4bit") # Prepare input image = ["http://images.cocodataset.org/val2017/000000039769.jpg"] prompt = "Describe this image." # Apply chat template formatted_prompt = apply_chat_template( processor, config, prompt, num_images=1 ) # Generate output output = generate(model, processor, formatted_prompt, image) print(output) - PaddleOCR
How to use huggingfinger0/PaddleOCR-VL-1.5-4bit with PaddleOCR:
# Please refer to the document for information on how to use the model. # https://paddlepaddle.github.io/PaddleOCR/latest/en/version3.x/module_usage/module_overview.html
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- LM Studio
PaddleOCR-VL-1.5 — 4-bit (MLX)
A 4-bit, group-size-64 MLX quantization of
PaddlePaddle/PaddleOCR-VL-1.5, for
on-device OCR on Apple Silicon. The newer
PaddleOCR-VL-1.6 is an architecture-identical
drop-in successor — this 1.5 quant is kept for reference.
| Base model | PaddlePaddle/PaddleOCR-VL-1.5 |
| Quantization | 4-bit, group size 64, affine |
| Format | MLX safetensors |
| Size | ~0.7 GB |
License & attribution
Apache-2.0, inherited from the base model. All credit for the model goes to the PaddlePaddle / PaddleOCR team — this repository only provides an MLX-quantized copy.
- Downloads last month
- 49
Model size
0.3B params
Tensor type
BF16
·
U32 ·
Hardware compatibility
Log In to add your hardware
4-bit
Model tree for huggingfinger0/PaddleOCR-VL-1.5-4bit
Base model
baidu/ERNIE-4.5-0.3B-Paddle Finetuned
PaddlePaddle/PaddleOCR-VL-1.5