Needle fine-tune export

This repo contains a browser-ready ONNX export of a locally fine-tuned Needle checkpoint.

Provenance

  • Base model: Cactus-Compute/needle
  • Fine-tuned checkpoint: needle_finetuned_20260608111244_50981_12_512_best.pkl
  • Fine-tuning data: UI tool-call dataset generated from the Needle demo UI

Files

File Description
encoder.onnx Needle encoder exported from the fine-tuned checkpoint
decoder_step.onnx One-step decoder with KV-cache I/O
needle.model SentencePiece tokenizer
tokenizer-specials.json Special token IDs used by the model

Usage

Load the two ONNX graphs with onnxruntime-web, load needle.model with sentencepiece-js, and run the encoder once followed by the decoder step in a JS loop.

Notes

This export follows the public porting guide from onnx-community/needle-onnx.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for justinebert1/needle_finetune_example

Quantized
(3)
this model