image

Nexus-Flash-9B-GGUF

This model is a fine-tuned version of unsloth/Qwen3.5-9B, optimized for agent-based reasoning tasks. It was trained using the Unsloth framework to achieve faster training speeds and memory efficiency.

📋 Model Details

🚀 Training & Optimization

This model was trained 2x faster using Unsloth combined with Hugging Face's TRL library. Unsloth allows for efficient fine-tuning of Large Language Models (LLMs) with significantly reduced VRAM usage and increased throughput.

Dataset Information

The model was fine-tuned on the Hermes Agent Reasoning Traces dataset. This dataset focuses on enhancing the model's ability to perform complex reasoning steps, particularly in agentic workflows, by providing detailed traces of thought processes and decision-making paths.

🎯 Intended Use & Capabilities

This model is designed for:

  • Agent Reasoning: Improved performance in tasks requiring multi-step logical deduction.
  • Complex Problem Solving: Better handling of intricate queries that require chain-of-thought processing.
  • General Text Generation: Maintains the strong general capabilities of the base Qwen3.5-9B model.

📄 License

This model is released under the Apache-2.0 license. Please refer to the base model's license and the dataset's license for any additional restrictions or requirements.

🙏 Acknowledgements

  • Unsloth: For providing the efficient fine-tuning framework.
  • Hugging Face TRL: For the training reinforcement library.
  • Lambda: For curating the Hermes Agent Reasoning Traces dataset.
  • Alibaba Cloud: For the original Qwen3.5 base model.
Downloads last month
105
GGUF
Model size
9B params
Architecture
qwen35
Hardware compatibility
Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for methil-group/nexus-flash-9B-GGUF

Finetuned
Qwen/Qwen3.5-9B
Quantized
(19)
this model

Collection including methil-group/nexus-flash-9B-GGUF