LeroyDyer's picture
Update README.md
c84c227 verified
metadata
language:
  - en
  - sw
license: apache-2.0
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - mistral
  - trl
base_model: LeroyDyer/Mixtral_AI_MiniTron_II
datasets:
  - iamshnoo/alpaca-cleaned-swahili
library_name: transformers

Uploaded model

  • Developed by: LeroyDyer
  • License: apache-2.0
  • Finetuned from model : LeroyDyer/Mixtral_AI_MiniTron_II

This is a smaller model easier for fine tuning !! (faster) This model was created from a fresh untrained model and has only been trained with swahili : it is still training!

Plus it will run and train on the laptop no problem ! (only with text corpuses the context needs to be low as it will force the gpu to consume memory so small articles only; later after intensive training the context can be re-extended etc: ) This model will be fully swahili speaking despite being adapted from and english speaking model : All training applied will be in swahili or other dialects @

undergoing fine tuning stages as well as merging stages and retuning stages ! Searching for instruct datasets in swahili

this is a super fine tuned model .... but it may be behind other models: in the series : Hence this model is for applying lora adapter found on the hub and other created for other models : once applying a lora , set the model in train mode: model.train() And Train on a previoulsy trained dataset before merging the new lora : make sure the prvious dataset still is inline with the model : Often a lora can nudge the model the wrong way and loose some of its previous training as it applys weights on top of the odel which may net be consistant with your model especially if the lora was not trained for this model (but still for the same series (ie mistral))..

This mistral model was trained 2x faster with Unsloth and Huggingface's TRL library.