GGUF
conversational

hailo-llama-3.2 Model Card

Description

This is an experimental GGUF model for Hailo hailo-llama-3.2 in llama.cpp.

Usage

You can use this model with the following branch: hailo.

Clone and build:

$ git clone -b hailo https://github.com/danbev/llama.cpp.git && cd llama.cpp
$ cmake -B build -S . -DGGML_HAILO=ON -DGGML_NATIVE=ON
$ cmake --build build --target llama-cli -j 12

Run:

$ ./build/bin/llama-cli -hf danbev/hailo-llama-3.2-GGUF
Downloads last month
241
GGUF
Model size
1B params
Architecture
hailo
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including danbev/hailo-llama-3.2-GGUF