metadata
base_model: lunahr/thea-v2-3b-50r
datasets:
- KingNish/reasoning-base-20k
language:
- en
license: llama3.2
library_name: peft
tags:
- text-generation-inference
- transformers
- llama
- trl
- sft
- reasoning
- llama-3
Model Description
The LoRA adapters that pertain to half-reasoned variant of Thea 3B v2.
You can merge them to your own Hermes 3 Llama 3.2 3B, but why?
Go to the model page to find out what Thea is.