Instructions to use FahrenheitResearch/FR-Forge-1.7B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use FahrenheitResearch/FR-Forge-1.7B with MLX:
# Make sure mlx-lm is installed # pip install --upgrade mlx-lm # Generate text with mlx-lm from mlx_lm import load, generate model, tokenizer = load("FahrenheitResearch/FR-Forge-1.7B") prompt = "Write a story about Einstein" messages = [{"role": "user", "content": prompt}] prompt = tokenizer.apply_chat_template( messages, add_generation_prompt=True ) text = generate(model, tokenizer, prompt=prompt, verbose=True) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- LM Studio
- Pi
How to use FahrenheitResearch/FR-Forge-1.7B with Pi:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "FahrenheitResearch/FR-Forge-1.7B"
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "mlx-lm": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "FahrenheitResearch/FR-Forge-1.7B" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use FahrenheitResearch/FR-Forge-1.7B with Hermes Agent:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "FahrenheitResearch/FR-Forge-1.7B"
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default FahrenheitResearch/FR-Forge-1.7B
Run Hermes
hermes
- MLX LM
How to use FahrenheitResearch/FR-Forge-1.7B with MLX LM:
Generate or start a chat session
# Install MLX LM uv tool install mlx-lm # Interactive chat REPL mlx_lm.chat --model "FahrenheitResearch/FR-Forge-1.7B"
Run an OpenAI-compatible server
# Install MLX LM uv tool install mlx-lm # Start the server mlx_lm.server --model "FahrenheitResearch/FR-Forge-1.7B" # Calling the OpenAI-compatible server with curl curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "FahrenheitResearch/FR-Forge-1.7B", "messages": [ {"role": "user", "content": "Hello"} ] }'
FR-Forge-1.7B
A Thin Language Model for manufacturing. Grounded, specialized, and running on your laptop.
GROUNDED. SPECIALIZED. LOCAL.
Contents
- Overview
- How it works
- Specifications
- Quickstart
- Intended use
- Evaluation
- Limitations
- Training
- Citation
Overview
FR-Forge-1.7B is a Thin Language Model (TLM) for the manufacturing sector, by Fahrenheit Research. Sibling to FR-Lex (legal). It is a small, 4-bit, MLX model fine-tuned with LoRA adapters on a curated manufacturing instruction corpus, designed to run locally on Apple Silicon.
Where a Thin Language Model fits
specialization
▲
│ ● FR-Forge 1.7B
│ narrow · local · cheap
│
│ ● Frontier LLM
│ broad · hosted · costly
└──────────────────────────────────────────────▶ generality
How it works
Manufacturing question
│
▼
┌─────────────────────────────────────────────────────┐
│ FR-Forge 1.7B │
└─────────────────────────────────────────────────────┘
│
reasons across three pillars
│
├─ Ops & Maintenance SOPs · PM/PdM · OEE · troubleshooting
├─ Quality & Compliance ISO 9001 · IATF 16949 · FDA/GMP · CAPA · FMEA · SPC
└─ Supply Chain & Planning MRP/ERP · BOMs · inventory · demand planning
│
▼
Grounded, local answer (verify exact clauses against the controlling standard)
It is an assistant, not a certified authority. It is not a substitute for the controlling standard, safety sign-off, or regulatory advice.
Specifications
| Base | Qwen3-1.7B (4-bit, MLX) |
| Parameters | 1.7B · 4-bit |
| Method | LoRA adapters, fused |
| Runtime | MLX (Apple Silicon) |
| Languages | English |
| License | Apache-2.0 |
Quickstart
Runs locally with MLX on Apple Silicon. Apply a repetition penalty to avoid looping.
pip install -U mlx-lm
python3 -m mlx_lm generate \
--model FahrenheitResearch/FR-Forge-1.7B \
--repetition-penalty 1.15 \
--max-tokens 500 \
--prompt "How should I set safety stock for a long-lead component with variable demand?"
Recommended generation settings: repetition_penalty = 1.15 (raise to 1.3 if output repeats). A system prompt of "You are FR-Forge, a manufacturing domain assistant." matches training.
Intended use
Assisting manufacturing teams with everyday domain questions across three pillars:
FR-Forge covers
├─ Shop-floor ops & maintenance SOPs, work instructions, equipment manuals, PM/PdM, OEE, troubleshooting
├─ Quality & compliance ISO 9001, IATF 16949, ISO 13485, FDA/GMP, CAPA, FMEA, SPC, MSA, audits
└─ Supply chain & planning MRP/ERP, procurement, BOMs, inventory, demand planning, suppliers
Out of scope. Safety, regulatory, or compliance sign-off; anything that must be exact (part numbers, clause text, customer-specific requirements) without grounding it in your own documents.
Evaluation
A held-out set of prompts across the three pillars is scored with a deterministic keyword-rubric: each item defines groups of required terms, and a group passes if any synonym appears in the answer. The item score is the fraction of groups covered; pillar and overall scores are averages. Generation uses a 1.15 repetition penalty.
Held-out eval, keyword rubric (percent coverage)
Ops & Maintenance ███████████████████░ 92.9
Quality & Compliance ██████████████░░░░░░ 70.8
Supply Chain & Planning █████████████████░░░ 83.3
─────────────────────── ──────────────────── ────
Overall ████████████████░░░░ 82.3
| Pillar | Score |
|---|---|
| Ops & Maintenance | 92.9% |
| Quality & Compliance | 70.8% |
| Supply Chain & Planning | 83.3% |
| Overall | 82.3% |
This measures domain-term coverage, not eloquence or factual grading, and the held-out set is small, so treat results as a directional, reproducible yardstick rather than a precise grade. The untuned base can be scored with python3 scripts/evaluate.py --base-only, and the tuned model is reproduced with python3 scripts/evaluate.py --max-tokens 500.
Limitations
Not a certified authority. Outputs assist research and drafting only. They are not a substitute for the controlling standard, safety sign-off, or regulatory advice.
- English only (v1).
- Paraphrased, not verbatim. Trained on domain reasoning, not reproduced standards; always verify clause-level detail against the controlling standard.
- Small model. For facts that must be exact (part numbers, clause text, customer-specific requirements), ground it with retrieval over your own documents rather than relying on memory.
- Self-reported metric. Evaluation is an internal keyword-coverage score on a small held-out set.
Training
Domain instruction pairs ─┐
110 ops & maintenance │
100 quality & compliance ├─▶ LoRA fine-tune (MLX) ─▶ fuse adapters ─▶ FR-Forge 1.7B
51 supply chain & planning┘ iters 800 · lr 1e-5 · seq 512
Training data, 261 instruction pairs
Ops & Maintenance ████████████████ 110
Quality & Compliance ███████████████░ 100
Supply Chain & Planning ███████░░░░░░░░░ 51
Base, method, data, and hyperparameters
- Base model:
mlx-community/Qwen3-1.7B-4bit(Qwen3 architecture, 4-bit) - Method: LoRA adapters via
mlx-lm, fused into this standalone model - Data: 261 curated instruction pairs (110 ops & maintenance, 100 quality & compliance, 51 supply chain & planning). Sources: paraphrased domain reasoning plus model-assisted synthetic pairs with human review. No copyrighted standard text is reproduced verbatim; the data teaches structure and reasoning and cites clause numbers.
- Hyperparameters: iters 800, LoRA layers 8, batch size 1, max sequence length 512, learning rate 1e-5, gradient checkpointing on. Peak training memory ~2 GB.
Citation
Apache-2.0. Base model mlx-community/Qwen3-1.7B-4bit is Apache-2.0.
@software{fr_forge_2026,
title = {FR-Forge-1.7B: a thin language model for manufacturing},
author = {Fahrenheit Research},
year = {2026},
note = {Fine-tuned from Qwen3-1.7B (4-bit) with MLX/LoRA}
}
FAHRENHEIT RESEARCH
Thin Language Models for specialized domains.
- Downloads last month
- 40
4-bit
Model tree for FahrenheitResearch/FR-Forge-1.7B
Evaluation results
- Overall (rubric coverage) on FR-Forge held-out eval (3 pillars, keyword-rubric)self-reported82.300
- Ops & Maintenance on FR-Forge held-out eval (3 pillars, keyword-rubric)self-reported92.900
- Quality & Compliance on FR-Forge held-out eval (3 pillars, keyword-rubric)self-reported70.800
- Supply Chain & Planning on FR-Forge held-out eval (3 pillars, keyword-rubric)self-reported83.300