GGUF conversion of zeroentropy/zerank-2-reranker. Converted with llama.cpp convert_hf_to_gguf.py, lm_head untied from embed_tokens.

Downloads last month
47
GGUF
Model size
4B params
Architecture
qwen3
Hardware compatibility
Log In to add your hardware

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for 1N4148/zerank-2-reranker-GGUF

Finetuned
Qwen/Qwen3-4B
Quantized
(9)
this model