Qwen 7B Theorem Engine v2 LoRA Adapter

This repository contains the LoRA adapter for a minimalist mathematical theorem correction model fine-tuned from Qwen/Qwen2.5-7B-Instruct.

The adapter is trained to correct flawed, incomplete, or garbled mathematical theorem-style statements while preserving the user's surrounding wording.

Model Contents

This is a PEFT/LoRA adapter, not a full standalone base model. To use it, load it on top of:

Qwen/Qwen2.5-7B-Instruct

Expected files include:

  • adapter_config.json
  • adapter_model.safetensors
  • tokenizer files copied from the training run

Intended Use

The model is intended for theorem-style mathematical statement correction, such as:

Input:  Is it true that every continuous function is differentiable?
Output: Is it true that every differentiable function is continuous?

It is not a proof assistant and should not be treated as an authority for formal verification.

Quick Start

Install the project dependencies from the companion GitHub repository, download the base model, then download this adapter into the expected local path:

python scripts/training/download_7b.py

huggingface-cli download <your-hf-id>/qwen-7b-theorem-engine-v2 \
  --local-dir outputs/qwen_7b_minimalist_engine_v2_trained

python scripts/inference/chat_windows.py

For macOS Apple Silicon, either download the separately converted MLX 4-bit model or merge this adapter into the base model and convert it with mlx_lm.convert.

Training Data

The release training dataset is included in the companion GitHub repository as:

data/student_notes_train_v2.json

The data was externally generated and curated for theorem-correction supervision. No private API key, credential, or private generation endpoint is included in the model files.

Training Configuration

Item Value
Base model Qwen/Qwen2.5-7B-Instruct
Method QLoRA
LoRA rank 16
LoRA alpha 32
LoRA dropout 0.05
Target modules q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
Max sequence length 1024

Limitations

  • Outputs should be checked by a human before use.
  • The model focuses on mathematical statement correction, not broad tutoring or formal proof checking.
  • It may preserve user phrasing even when a more natural standalone theorem statement would be possible.

Base Model

This adapter depends on Qwen/Qwen2.5-7B-Instruct. Please follow the base model's license and usage terms.

Downloads last month
16
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for dots123/qwen-7b-theorem-engine-v2

Base model

Qwen/Qwen2.5-7B
Adapter
(2098)
this model