lunahr
/

thea-rp-3b-25r-adapter

text-generation-inference

Model card Files Files and versions Community

thea-rp-3b-25r-adapter / README.md

Piotr Zalewski

Rewritten README

565c888 verified 3 months ago

|

467 Bytes

	---
	base_model: SicariusSicariiStuff/Impish_LLAMA_3B
	datasets:
	- KingNish/reasoning-base-20k
	language:
	- en
	license: llama3.2
	tags:
	- text-generation-inference
	- transformers
	- llama
	- trl
	- sft
	- reasoning
	- llama-3
	---

	# Model Description
	The LoRA adapters that pertain to 25% reasoned variant of Thea RP 3B.

	You can merge them to your own Llama 3.2 3B, but why?

	Go to the [model page](https://huggingface.co/piotr25691/thea-rp-3b-25r) to find out what Thea is.