metadata

library_name: transformers
license: other
datasets:
  - mlabonne/orpo-dpo-mix-40k
  - Open-Orca/SlimOrca-Dedup
  - jondurbin/airoboros-3.2
  - microsoft/orca-math-word-problems-200k
  - m-a-p/Code-Feedback
  - MaziyarPanahi/WizardLM_evol_instruct_V2_196k
base_model: meta-llama/Meta-Llama-3-8B

llama-3-neural-chat-v1-8b

Model Details

Model Description

I fine-tuned llama-3 8B on an approach similar to Intel's neural chat language model. I have slightly modified the data sources so it is stronger in coding, math, and writing. I use both SFT and DPO.

Developed by: Locutusque
Model type: Built with Meta Llama 3
Language(s) (NLP): Many?
License: Llama 3 license https://huggingface.co/meta-llama/Meta-Llama-3-8B/blob/main/LICENSE

Uses

This model has great performance in writing and coding.

Training Data

Open-Orca/SlimOrca-Dedup
jondurbin/airoboros-3.2
microsoft/orca-math-word-problems-200k
m-a-p/Code-Feedback
MaziyarPanahi/WizardLM_evol_instruct_V2_196k
mlabonne/orpo-dpo-mix-40k

Direct Use

Conversational AI.

Evaluations

TBD.