π LLM Engineer's Handbook
Collection
Models and datasets from my book. All the code is freely available at https://github.com/PacktPublishing/LLM-Engineers-Handbook
β’
6 items
β’
Updated
TwinLlama-3.1-8B is a model created for the LLM Engineer's Handbook, trained on mlabonne/llmtwin.
It is designed to act as a digital twin, which is a clone of myself and my co-authors (Paul Iusztin and Alex Vesa), imitating our writing style and drawing knowledge from our articles.
This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.
8-bit
Base model
meta-llama/Llama-3.1-8B