metadata
base_model: SicariusSicariiStuff/Impish_LLAMA_3B
datasets:
- KingNish/reasoning-base-20k
language:
- en
license: llama3.2
tags:
- text-generation-inference
- transformers
- llama
- trl
- sft
- reasoning
- llama-3
Model Description
The LoRA adapters that pertain to 25% reasoned variant of Thea RP 3B.
You can merge them to your own Llama 3.2 3B, but why?
Go to the model page to find out what Thea is.