Text Generation
Transformers
Safetensors
Swahili
English
mistral
text-generation-inference
unsloth
trl
conversational
Inference Endpoints
Edit model card

Main model

  • Developed by: LeroyDyer
  • License: apache-2.0
  • Finetuned from model : LeroyDyer/Mixtral_AI_CyberTron_Swahili_SFT This model will be fully swahili speaking despite being adapted from and english speaking model : All training applied will be in swahili or other dialects @

undergoing fine tuning stages as well as merging stages and retuning stages ! Searching for instruct datasets in swahili

this is a super fine tuned model .... but it may be behind other models: in the series : Hence this model is for applying lora adapter found on the hub and other created for other models : once applying a lora , set the model in train mode: model.train() And Train on a previoulsy trained dataset before merging the new lora : make sure the prvious dataset still is inline with the model : Often a lora can nudge the model the wrong way and loose some of its previous training as it applys weights on top of the odel which may net be consistant with your model especially if the lora was not trained for this model (but still for the same series (ie mistral))..

Downloads last month
17
Safetensors
Model size
7.24B params
Tensor type
FP16
·
Invalid base_model specified in model card metadata. Needs to be a model id from hf.co/models.

Datasets used to train LeroyDyer/Mixtral_AI_CyberTron_Swahili_7b

Collections including LeroyDyer/Mixtral_AI_CyberTron_Swahili_7b