Main model

Developed by: LeroyDyer
License: apache-2.0
Finetuned from model : LeroyDyer/Mixtral_AI_CyberTron_Swahili_SFT This model will be fully swahili speaking despite being adapted from and english speaking model : All training applied will be in swahili or other dialects @

undergoing fine tuning stages as well as merging stages and retuning stages ! Searching for instruct datasets in swahili

this is a super fine tuned model .... but it may be behind other models: in the series : Hence this model is for applying lora adapter found on the hub and other created for other models : once applying a lora , set the model in train mode: model.train() And Train on a previoulsy trained dataset before merging the new lora : make sure the prvious dataset still is inline with the model : Often a lora can nudge the model the wrong way and loose some of its previous training as it applys weights on top of the odel which may net be consistant with your model especially if the lora was not trained for this model (but still for the same series (ie mistral))..

LeroyDyer
/

Mixtral_AI_CyberTron_Swahili_7b

Main model

Datasets used to train LeroyDyer/Mixtral_AI_CyberTron_Swahili_7b

Collections including LeroyDyer/Mixtral_AI_CyberTron_Swahili_7b

SwahiliBots

CyberTron_AI