QuantFactory Banner

QuantFactory/Llama-3.2-3B-Agent007-GGUF

This is quantized version of EpistemeAI/Llama-3.2-3B-Agent007 created using llama.cpp

Original Model Card

Uploaded model

  • Developed by: EpistemeAI
  • License: apache-2.0
  • Finetuned from model : unsloth/llama-3.2-3b-instruct-bnb-4bit

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month
205
GGUF
Model size
3.21B params
Architecture
llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference API
Unable to determine this model’s pipeline type. Check the docs .