cstr
/

pix2struct-GGUF

document-understanding

Model card Files Files and versions

Pix2Struct Base (GGUF)

Variable-resolution image-to-text model for documents, charts, tables. 282M params (12L encoder + 12L decoder), Apache-2.0.

Encoder parity: cos=1.000000 vs HuggingFace. Source: google/pix2struct-base.

Downloads last month: 34

GGUF

Model size

0.3B params

Architecture

pix2struct

Hardware compatibility

Log In to add your hardware

32-bit