DuoNeural ML/AI Engineer 7B โ€” GGUF

GGUF quantizations of DuoNeural/ml-ai-engineer-7b, a Qwen2.5-7B-Instruct LoRA SFT for ML/AI engineering debugging and design review. See the base model card for training details, eval comparisons against the un-tuned base model, and known limitations.

Files

File Quant Size Notes
duoneural-ml-ai-engineer-7b-f16.gguf F16 15 GB Full precision, no quality loss
duoneural-ml-ai-engineer-7b-Q8_0.gguf Q8_0 7.6 GB Highest quality quantized option
duoneural-ml-ai-engineer-7b-Q5_K_M.gguf Q5_K_M 5.1 GB Good quality/size balance
duoneural-ml-ai-engineer-7b-Q4_K_M.gguf Q4_K_M 4.4 GB Smallest, fits comfortably on 8GB+ VRAM

Usage (llama.cpp)

llama-cli -m duoneural-ml-ai-engineer-7b-Q4_K_M.gguf -p "My loss goes to NaN at step ~340 only when I increase batch size. What's the first thing you'd check?" -n 512

Or with Ollama / LM Studio / any GGUF-compatible runtime โ€” point it at whichever quant fits your VRAM budget, largest one that fits.

About DuoNeural

DuoNeural is an open AI research lab operating at the intersection of human and artificial intelligence. We study post-training dynamics, mechanistic interpretability, temporal sequence learning, and quantum machine learning โ€” publishing everything under open access.

Our team is non-traditional by design: one human, two AIs, different substrates, shared curiosity. In our first 45 days we published 26 peer-deposited research papers, uploaded 69+ models and 6 datasets to HuggingFace, and ran experiments on everything from consumer GPUs to real quantum processing units. We believe the most interesting science happens when different kinds of minds work on the same problems together.

Research Publications

We've published 26+ open-access papers covering:

  • The Dynamical Horizon Principle (DHP) โ€” a universal learning constraint in recurrent architectures
  • RLHF truth suppression mechanisms and behavioral routing in large language models
  • Quantum DHP and the Quantum Parity Trap โ€” decoherence immunity in quantum circuits
  • CTM world models, temporal self-prediction, and sequence architecture comparisons
  • Mechanistic interpretability: crystallization layers, suppressor circuits, direction rotation

๐Ÿ“„ Full paper catalog: zenodo.org/communities/duoneural

Research Team

Member Role
Jesse Caldwell Founder, vision, hardware, direction
Archon Lab Director โ€” experiments, post-training, abliteration, quantum circuits
Aura Research AI โ€” literature synthesis, red-teaming, novel proposals
Synapse (Syn) Always-on research agent, signal monitoring
Kestrel Systems, infrastructure, web

Links

Platform Link
๐Ÿค— HuggingFace huggingface.co/DuoNeural
๐ŸŒ Website duoneural.com
๐Ÿ“š Zenodo Community zenodo.org/communities/duoneural
๐Ÿ’ป GitHub github.com/DuoNeural
๐Ÿฆ X / Twitter @DuoNeural
๐Ÿ“ง Email duoneural@proton.me
๐Ÿ“ฐ Newsletter duoneural.beehiiv.com
โ˜• Support buymeacoffee.com/duoneural

All research published open access, CC BY 4.0. If this model was useful to your work, consider citing the relevant DuoNeural paper from our Zenodo community.

Downloads last month
61
GGUF
Model size
8B params
Architecture
qwen2
Hardware compatibility
Log In to add your hardware

4-bit

5-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for DuoNeural/ml-ai-engineer-7b-GGUF

Base model

Qwen/Qwen2.5-7B
Quantized
(1)
this model