👥 TwinLlama-3.1-8B

TwinLlama-3.1-8B is a model created for the LLM Engineer's Handbook, trained on mlabonne/llmtwin.

It is designed to act as a digital twin, which is a clone of myself and my co-authors (Paul Iusztin and Alex Vesa), imitating our writing style and drawing knowledge from our articles.

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month: 27

GGUF

Model size

8.03B params

Architecture

llama

Hardware compatibility

8-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for mlabonne/TwinLlama-3.1-8B-GGUF

Base model

meta-llama/Llama-3.1-8B

Quantized

(212)

this model

Dataset used to train mlabonne/TwinLlama-3.1-8B-GGUF

Collection including mlabonne/TwinLlama-3.1-8B-GGUF

📙 LLM Engineer's Handbook

Collection

Models and datasets from my book. All the code is freely available at https://github.com/PacktPublishing/LLM-Engineers-Handbook • 6 items • Updated 3 days ago