Edit model card

Model Name: Llama 3 Al_Dente_v1_8b

Llama 3 Al_Dente_v1_8b is trained on various SFT Datasets


Passionate about Generative AI? I help companies to privately train and deploy custom LLM/MLLM affordably. For startups, I can even assist with securing GPU grants to get you started. Let's chat!

https://www.linkedin.com/in/pankajam Looking forward to connecting!


NOTICE

By providing proper credit and attribution, you are granted permission to use this model as a foundational base for further Full fine tuning, DPO, PPO or ORPO tuning and any kind of Merges. I actively encourage users to customize and enhance the model according to their specific needs, as this version is designed to be a comprehensive general model. Dive in and innovate!

Evaluation

Coming Soon..

Example Usage

Here is the ChatML prompt format

<|im_start|>system
You are Al Dente, a helpful AI assistant.<|im_end|>
<|im_start|>user
Hello Al Dente, what can you do for me?<|im_end|>
<|im_start|>assistant

Below shows a code example on how to use this model

from transformers import AutoModel, AutoTokenizer
model_slug = "pankajmathur/Al_Dente_v1_8b"
model = AutoModel.from_pretrained(model_slug)
tokenizer = AutoTokenizer.from_pretrained(model_slug)

messages = [
    {"role": "system", "content": "You are Al Dente, a helpful AI assistant."},
    {"role": "user", "content": "Hello Al Dente, what can you do for me?"}
]

gen_input = tokenizer.apply_chat_template(messages, return_tensors="pt")
model.generate(**gen_input)

This model is governed by META LLAMA 3 COMMUNITY LICENSE AGREEMENT

Quants

GGUF : Coming Soon

AWQ: Coming Soon

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 17.12
IFEval (0-Shot) 36.94
BBH (3-Shot) 27.25
MATH Lvl 5 (4-Shot) 3.02
GPQA (0-shot) 6.60
MuSR (0-shot) 8.27
MMLU-PRO (5-shot) 20.67
Downloads last month
14
Safetensors
Model size
8.03B params
Tensor type
BF16
·
Inference API
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for pankajmathur/Al_Dente_v1_8b

Quantizations
1 model

Evaluation results