GLM-OCR CrispEmbed GGUF
GLM-Edge-V 2B vision-language model converted to GGUF for OCR with CrispEmbed.
Models
| File | Quant | Size |
|---|---|---|
glm-ocr-f16.gguf |
F16 | ~3.8 GB |
glm-ocr-q8_0.gguf |
Q8_0 | ~2.0 GB |
glm-ocr-q4_k.gguf |
Q4_K | ~1.1 GB |
Architecture
- Base: GLM-Edge-V 2B (THUDM, Apache-2.0)
- Vision: SigLIP vision encoder
- LLM: GLM-4 decoder (2B params)
- Task: Document OCR, scene text, handwriting
Usage
from crispembed import CrispOcrPipeline
ocr = CrispOcrPipeline(vlm_model="glm-ocr-q8_0.gguf")
text = ocr.recognize("document.png")
Original Model
THUDM/glm-edge-v-2b โ GLM-Edge-V 2B, CogViT + GLM-0.5B, 8 languages.
License
Apache-2.0
- Downloads last month
- 242
Hardware compatibility
Log In to add your hardware
8-bit
16-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support
Model tree for cstr/glm-ocr-crispembed-GGUF
Base model
zai-org/glm-edge-v-2b