GLM-OCR CrispEmbed GGUF

GLM-Edge-V 2B vision-language model converted to GGUF for OCR with CrispEmbed.

Models

File Quant Size
glm-ocr-f16.gguf F16 ~3.8 GB
glm-ocr-q8_0.gguf Q8_0 ~2.0 GB
glm-ocr-q4_k.gguf Q4_K ~1.1 GB

Architecture

  • Base: GLM-Edge-V 2B (THUDM, Apache-2.0)
  • Vision: SigLIP vision encoder
  • LLM: GLM-4 decoder (2B params)
  • Task: Document OCR, scene text, handwriting

Usage

from crispembed import CrispOcrPipeline

ocr = CrispOcrPipeline(vlm_model="glm-ocr-q8_0.gguf")
text = ocr.recognize("document.png")

Original Model

THUDM/glm-edge-v-2b โ€” GLM-Edge-V 2B, CogViT + GLM-0.5B, 8 languages.

License

Apache-2.0

Downloads last month
242
GGUF
Model size
1B params
Architecture
glm_ocr
Hardware compatibility
Log In to add your hardware

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for cstr/glm-ocr-crispembed-GGUF

Quantized
(1)
this model