Edit model card

This model will be fully swahili speaking despite being adapted from and english speaking model : All training applied will be in swahili or other dialects @

undergoing fine tuning stages as well as merging stages and retuning stages ! Searching for instruct datasets in swahili

this is a super fine tuned model .... but it may be behind other models: in the series : Hence this model is for applying lora adapter found on the hub and other created for other models : once applying a lora , set the model in train mode: model.train() And Train on a previoulsy trained dataset before merging the new lora : make sure the prvious dataset still is inline with the model : Often a lora can nudge the model the wrong way and loose some of its previous training as it applys weights on top of the odel which may net be consistant with your model especially if the lora was not trained for this model (but still for the same series (ie mistral))..

Downloads last month
392
Safetensors
Model size
7.24B params
Tensor type
FP16
·

Datasets used to train LeroyDyer/Mixtral_AI_SwahiliTron_7b

Space using LeroyDyer/Mixtral_AI_SwahiliTron_7b 1

Collection including LeroyDyer/Mixtral_AI_SwahiliTron_7b