Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
lunahr
/
thea-rp-3b-25r
like
1
Text Generation
Transformers
Safetensors
KingNish/reasoning-base-20k
lunahr/thea-name-overrides
English
llama
text-generation-inference
trl
sft
reasoning
llama-3
conversational
Eval Results
Inference Endpoints
License:
llama3.2
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
main
thea-rp-3b-25r
/
model-00002-of-00002.safetensors
Commit History
Name override with rsLoRA(rank=128, alpha=256)
56aa291
unverified
lunahr
commited on
Oct 17
Upload merged BF16 model
73e5b07
verified
Piotr Zalewski
commited on
Oct 13