linkd-dsl-qwen3-4b

Merged full-weights release (Qwen/Qwen3-4B-Instruct-2507 + the linkd-search DSL LoRA, ericmao/linkd-dsl-qwen3-4b-lora). Turns free-form people-search queries into MongoDB find filters for the Berkeley.profilematch collection.

Held-out eval (200 queries, result-grounded gpt-oss-120b judge): reward 0.1507 vs gpt-5.5-medium 0.1043, 99% valid DSL, 0.76 s p50 generation on an RTX 4090.

Usage

vllm serve ericmao/linkd-dsl-qwen3-4b \
  --max-model-len 2048 \
  --speculative-config '{"method":"ngram","num_speculative_tokens":8,"prompt_lookup_max":4,"prompt_lookup_min":2}'

Call with the exact production system prompt (linkd-search repo, slm/common.py:SYSTEM_PROMPT), temperature=0; the completion is a raw JSON Mongo filter for collection.find(filter).limit(20).

Downloads last month
1
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ericmao/linkd-dsl-qwen3-4b

Finetuned
(1733)
this model