Locutusque/llama-3-neural-chat-v1-8b AWQ

Model creator: Locutusque
Original model: llama-3-neural-chat-v1-8b

Model Summary

I fine-tuned llama-3 8B on an approach similar to Intel's neural chat language model. I have slightly modified the data sources so it is stronger in coding, math, and writing. I use both SFT and DPO.

This model has great performance in writing and coding.

Training Data

Open-Orca/SlimOrca-Dedup
jondurbin/airoboros-3.2
microsoft/orca-math-word-problems-200k
m-a-p/Code-Feedback
MaziyarPanahi/WizardLM_evol_instruct_V2_196k
mlabonne/orpo-dpo-mix-40k

Downloads last month: 13

Safetensors

Model size

1.98B params

Tensor type

I32

FP16

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model authors have turned it off explicitly.

Model tree for solidrust/llama-3-neural-chat-v1-8b-AWQ

Base model

meta-llama/Meta-Llama-3-8B

Finetuned

Locutusque/llama-3-neural-chat-v1-8b

Quantized

(2)

this model

Datasets used to train solidrust/llama-3-neural-chat-v1-8b-AWQ

Collection including solidrust/llama-3-neural-chat-v1-8b-AWQ

8B AWQ

Collection

164 items • Updated Sep 3, 2024 • 1