PEFT
Safetensors

Model Card for Model ID

This LoRA model is a product of the research paper "LLM QLoRA Fine-Tuning of Llama, DeepSeek, and Qwen: A Skyrim Case Study" (to appear in IEEE Access). The study investigates the interplay between model scale (~8B, ~13B, ~33B), architecture, and data formatting in adapting Large Language Models to knowledge-intensive domains.

By utilizing a multi-stage data cycling strategy and 4-bit Quantized Low-Rank Adaptation (QLoRA), this model was fine-tuned to master the lore of The Elder Scrolls V: Skyrim. The training process involved rigorous hyperparameter optimization, comparing unstructured, structured, and summarized datasets across varying LoRA ranks (16, 32, 64) to determine the optimal configuration for factual recall and narrative fluency. The resulting model demonstrates the capability to generate high-fidelity, hallucination-resistant character biographies and attribute lists, as validated by a comprehensive ensemble LLM-as-a-Judge evaluation framework.

Model Details

Model Description

  • Skyrim Lore - Qwen3-8B
  • Description: Unlike other models in this tier, Qwen3-8B struggled with high-entropy summaries but excelled with explicit structure. Fine-tuned on the Structured dataset with a LoRA Rank of 64, it effectively learned to format lore into strict JSON-like schemas, achieving a Factual Score of 3.0.
  • Best Configuration: Structured Dataset | Rank 64
  • Base Model: unsloth/Qwen3-8B
  • Note: You can also use the pre-quantized version unsloth/Qwen3-8B-bnb-4bit (check repo availability).

Framework versions

  • PEFT 0.15.2
Downloads last month
2
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including MarcosEdu/Skyrim_Qwen3-8B