Instructions to use macmacmacmac/VibeThinker-3B-BugBounty-Triage-litert-lm with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- LiteRT-LM
How to use macmacmacmac/VibeThinker-3B-BugBounty-Triage-litert-lm with LiteRT-LM:
# LiteRT-LM runs on various platforms (Android, iOS, Windows, Linux, macOS, IoT, Web/WASM) # and supports many APIs (C++, Python, Kotlin, Swift, JavaScript, Flutter). # For platform-specific integration guides, please refer to the official developer website: # https://ai.google.dev/edge/litert-lm # To try LiteRT-LM, the easiest way is to use our CLI tool. # 1. Install the LiteRT-LM CLI tool: pip install litert-lm # 2. Download and run this model locally: # See: https://ai.google.dev/edge/litert-lm/cli litert-lm run \ --from-huggingface-repo=macmacmacmac/VibeThinker-3B-BugBounty-Triage-litert-lm \ model.litertlm \ --prompt="Write me a poem"
- LiteRT
How to use macmacmacmac/VibeThinker-3B-BugBounty-Triage-litert-lm with LiteRT:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- Notebooks
- Google Colab
- Kaggle
VibeThinker-3B · BugBounty-Triage · LiteRT-LM (fp16)
On-device LiteRT-LM (.litertlm) build of a
VibeThinker-3B model fine-tuned for
bug-bounty / vulnerability-disclosure triage (the vibebounty sidecar).
It reads a single researcher submission and returns a JSON verdict — one of 9 dispositions
(valid_impactful, valid_low, corroborated_surge, likely_duplicate, out_of_scope,
theoretical_no_poc, self_inflicted, accepted_risk, slop) with severity, confidence, and a rationale.
Files
| file | precision | size | signatures | context |
|---|---|---|---|---|
vibethinker3b-bbtriage-v2_fp16_ekv8192.litertlm |
fp16 | ~6.4 GB | prefill + decode | 8192 |
Eval (held-out, via the production harness path)
| metric | this model | prior tune | heuristic baseline |
|---|---|---|---|
| disposition accuracy | 95% | 35% | 56% |
| accept / reject | 100% | 70% | 97% |
| severity exact / ≤1 | 95% / 100% | 25% / 68% | 32% / 71% |
| parse failures | 0 | many | — |
n = 20 held-out reports (subset; CPU eval). Strong signal, not the full 300-set number.
Provenance & conversion
- Base:
WeiboAI/VibeThinker-3B(Qwen2.5-3B architecture, reasoning model) - Fine-tune: LoRA (rank 16, all 36 layers, q/k/v/o + MLP), trained via MLX on ~18k labeled disclosure outcomes, then fused into the base weights
- Convert:
litert-torch→ multi-signature.tflite(fp16), packaged withlitert-lm-builder—qwen2p5metadata, 8192 KV cache, HF tokenizer, chat template, stop tokens151645/151643 - Why fp16, not int8: this aggressive (
scale 20) merge has weight outliers that int8 weight-quantization clips into incoherent output; fp16 preserves the fine-tune. (A separately-validated int8 build may be offered if it survives quantization.)
Usage
litert-lm import vibethinker3b-bbtriage-v2_fp16_ekv8192.litertlm bbtriage
litert-lm serve --port 8092 # OpenAI-compatible /v1/chat/completions
Then point any OpenAI client — or the vibebounty harness (MODEL_BASE_URL) — at it.
License
MIT, inherited from the base model WeiboAI/VibeThinker-3B.
- Downloads last month
- 4