antflydb/gliner2-base-v1

Antfly GLiNER2 base bundle with a ClipClap-style flat variants layout.

The gguf-Q4_K variant is a split native bundle:

  • gliner2-encoder.Q4_K.gguf
  • gliner2-head.Q4_K.gguf

The repo also includes:

  • model.onnx, copied from lion-ai/gliner2-base-v1-onnx
  • model.safetensors, copied from fastino/gliner2-base-v1
  • tokenizer and GLiNER2 sidecars from the upstream/base bundle

Antfly Inference reads antfly_inference_variants.json and resolves complete encoder/head pairs as gliner2_split_bundle/v1 at load time.

How to Use with Antfly Inference

antfly inference pull antflydb/gliner2-base-v1:gguf:Q4_K
antfly inference run

Advertised Antfly Inference tasks:

  • recognize
  • extract
  • classify

Relation extraction is advertised as a capability and is driven through the extraction schema.

Downloads last month
197
Safetensors
Model size
0.2B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for antflydb/gliner2-base-v1

Quantized
(5)
this model