candenizkocak
/

CoderLlama-3.1-8B-GGUF

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Model Description

This model is a fine-tuned version of unsloth/Meta-Llama-3.1-8B-bnb-4bit on cognitivecomputations/Code-290k-ShareGPT-Vicuna in order to answer questions related to programming better. Trained by the Google Colab Notebook provided by Unsloth with small modifications. Dataset format was converted from ShareGPT to Llama 3 in the training notebook. First 10k rows was used in training for demonstration purposes.

Developed by: Can Deniz Koçak
Finetuned from model: unsloth/Meta-Llama-3.1-8B-bnb-4bit

Fine-tuning Data

cognitivecomputations/Code-290k-ShareGPT-Vicuna

Training Procedure

Trained on a single A100 on Google Colab.

Developed by: candenizkocak
License: apache-2.0
Finetuned from model : unsloth/Meta-Llama-3.1-8B-bnb-4bit

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month: 26

GGUF

Model size

8.03B params

Architecture

llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference API

Unable to determine this model’s pipeline type. Check the docs .

Model tree for candenizkocak/CoderLlama-3.1-8B-GGUF

Base model

meta-llama/Llama-3.1-8B

Quantized

unsloth/Meta-Llama-3.1-8B-bnb-4bit

Quantized

(238)

this model