Edit model card

Model Card for Model ID

malhajar/Mixtral-8x7B-v0.1-turkish is a finetuned version of Mixtral-8x7B-v0.1 using SFT Training. This model can answer information in turkish language as it is finetuned on a turkish dataset specifically alpaca-gpt4-tr

Model Description

Prompt Template

### Instruction:

<prompt> (without the <>)

### Response:

How to Get Started with the Model

Use the code sample provided in the original post to interact with the model.

from transformers import AutoTokenizer,AutoModelForCausalLM
 
model_id = "malhajar/Mixtral-8x7B-v0.1-turkish"
model = AutoModelForCausalLM.from_pretrained(model_name_or_path,
                                             device_map="auto",
                                             torch_dtype=torch.float16,
                                             revision="main")

tokenizer = AutoTokenizer.from_pretrained(model_id)

question: "Türkiyenin en büyük şehir nedir?"
# For generating a response
prompt = f'''
### Instruction:  {question} ### Response:
'''
input_ids = tokenizer(prompt, return_tensors="pt").input_ids
output = model.generate(inputs=input_ids,max_new_tokens=512,pad_token_id=tokenizer.eos_token_id,top_k=50, do_sample=True,repetition_penalty=1.3
        top_p=0.95,trust_remote_code=True,)
response = tokenizer.decode(output[0])

print(response)
Downloads last month
19
Safetensors
Model size
46.7B params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for malhajar/Mixtral-8x7B-v0.1-turkish

Quantizations
1 model

Dataset used to train malhajar/Mixtral-8x7B-v0.1-turkish