Text Generation
Transformers
Safetensors
English
mistral
text-generation-inference
unsloth
trl
sft
Inference Endpoints
Edit model card

::: DEEP MIND PROJECT :::

here we begin the models for Deep mind :

this model created from the first trained models : deepmind! nice ! :: alittle bit formal : was still stuck on writing a transfomer from scratch !!! or writing a training loop! (stil could do other good stuff) these models contain:

thoughts and processes :

SelfRAG:

Agent Generation:

Chain of thoughts :

Deep thinking and memory recall:

Standard instruct version!

  • Finetuned from model : LeroyDyer/Mixtral_AI_CyberTron_DeepMind_II

This mistral model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month
278
Safetensors
Model size
7.24B params
Tensor type
FP16
·

Finetuned from

Datasets used to train LeroyDyer/Mixtral_AI_CyberTron_DeepMind_III

Collections including LeroyDyer/Mixtral_AI_CyberTron_DeepMind_III