Instructions to use jaimef21/security-lora-qwen2.5-coder-3b-gguf with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use jaimef21/security-lora-qwen2.5-coder-3b-gguf with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="jaimef21/security-lora-qwen2.5-coder-3b-gguf", filename="security-lora.q4_k_m.gguf", )
llm.create_chat_completion( messages = "No input example has been defined for this model task." )
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use jaimef21/security-lora-qwen2.5-coder-3b-gguf with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf jaimef21/security-lora-qwen2.5-coder-3b-gguf:Q4_K_M # Run inference directly in the terminal: llama-cli -hf jaimef21/security-lora-qwen2.5-coder-3b-gguf:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf jaimef21/security-lora-qwen2.5-coder-3b-gguf:Q4_K_M # Run inference directly in the terminal: llama-cli -hf jaimef21/security-lora-qwen2.5-coder-3b-gguf:Q4_K_M
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf jaimef21/security-lora-qwen2.5-coder-3b-gguf:Q4_K_M # Run inference directly in the terminal: ./llama-cli -hf jaimef21/security-lora-qwen2.5-coder-3b-gguf:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf jaimef21/security-lora-qwen2.5-coder-3b-gguf:Q4_K_M # Run inference directly in the terminal: ./build/bin/llama-cli -hf jaimef21/security-lora-qwen2.5-coder-3b-gguf:Q4_K_M
Use Docker
docker model run hf.co/jaimef21/security-lora-qwen2.5-coder-3b-gguf:Q4_K_M
- LM Studio
- Jan
- Ollama
How to use jaimef21/security-lora-qwen2.5-coder-3b-gguf with Ollama:
ollama run hf.co/jaimef21/security-lora-qwen2.5-coder-3b-gguf:Q4_K_M
- Unsloth Studio new
How to use jaimef21/security-lora-qwen2.5-coder-3b-gguf with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for jaimef21/security-lora-qwen2.5-coder-3b-gguf to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for jaimef21/security-lora-qwen2.5-coder-3b-gguf to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for jaimef21/security-lora-qwen2.5-coder-3b-gguf to start chatting
- Pi new
How to use jaimef21/security-lora-qwen2.5-coder-3b-gguf with Pi:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf jaimef21/security-lora-qwen2.5-coder-3b-gguf:Q4_K_M
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "llama-cpp": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "jaimef21/security-lora-qwen2.5-coder-3b-gguf:Q4_K_M" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use jaimef21/security-lora-qwen2.5-coder-3b-gguf with Hermes Agent:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf jaimef21/security-lora-qwen2.5-coder-3b-gguf:Q4_K_M
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default jaimef21/security-lora-qwen2.5-coder-3b-gguf:Q4_K_M
Run Hermes
hermes
- Docker Model Runner
How to use jaimef21/security-lora-qwen2.5-coder-3b-gguf with Docker Model Runner:
docker model run hf.co/jaimef21/security-lora-qwen2.5-coder-3b-gguf:Q4_K_M
- Lemonade
How to use jaimef21/security-lora-qwen2.5-coder-3b-gguf with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull jaimef21/security-lora-qwen2.5-coder-3b-gguf:Q4_K_M
Run and chat with the model
lemonade run user.security-lora-qwen2.5-coder-3b-gguf-Q4_K_M
List all available models
lemonade list
security-lora-qwen2.5-coder-3b-gguf
GGUF quants of jaimef21/security-lora-qwen2.5-coder-3b-bf16, a Qwen2.5-Coder-3B-Instruct LoRA fine-tune for detecting malicious npm / PyPI supply-chain attack code.
Outputs a structured JSON verdict with cited line ranges. Tuned via DPO to suppress false positives on legitimate minified / telemetry / crypto code.
Training pipeline: https://github.com/jaimef21/security-lora
Files
| File | Quant | Size | Notes |
|---|---|---|---|
security-lora.q8_0.gguf |
Q8_0 | ~3.2 GB | High fidelity; indistinguishable from F16 for most use |
security-lora.q4_k_m.gguf |
Q4_K_M | ~1.9 GB | Runs comfortably on an 8 GB laptop or 4 GB CPU server |
Usage with llama.cpp
./llama-cli -m security-lora.q4_k_m.gguf \
-p "Analyze this code for malicious behaviour. Source: \`example.js\`.\n\n\`\`\`js\nconst x = require('child_process').exec(process.env.CMD);\n\`\`\`"
Usage with Ollama
Build a Modelfile locally:
FROM ./security-lora.q4_k_m.gguf
SYSTEM "You are a security analyst. Given a source code file, return a JSON verdict {verdict, confidence, categories, reasoning, iocs} for whether the code is malicious. verdict โ {malicious, suspicious, benign}. Always cite specific line ranges in iocs.lines when flagging behaviour. Be precise and concise."
PARAMETER temperature 0.1
PARAMETER num_ctx 8192
PARAMETER stop "<|im_end|>"
Then:
ollama create security-scan -f Modelfile
ollama run security-scan "Analyze this code..."
Output schema
{
"verdict": "malicious | suspicious | benign",
"confidence": 0.0,
"categories": ["env_exfil", "crypto_miner", "backdoor", "typosquat",
"install_hook_abuse", "obfuscated_eval", "c2_beacon",
"credential_theft", "protestware"],
"reasoning": "step-by-step taint flow / deobfuscation walkthrough",
"iocs": {"domains": [], "files": [], "lines": [], "env_vars": []}
}
Training pipeline
- CPT โ Continued pre-training on Datadog malicious-packages + Advisory DB + Semgrep registry.
- SFT โ Teacher-labeled verdicts (Qwen3-Coder-480B-A35B-FP8) on confirmed-malicious + popular-benign npm samples.
- DPO โ 300 benign-lookalike pairs to suppress false positives on minified bundles, telemetry SDKs, real crypto.
- Downloads last month
- 214
4-bit
8-bit
Model tree for jaimef21/security-lora-qwen2.5-coder-3b-gguf
Base model
Qwen/Qwen2.5-3B