mhv_synthetic_qwen3.5_v0.3

Fine-tuned version of kenpath/mhv_mhv-_all_qwen3.5-27b_v0.3 for the Vistaar (unknown) use case.

Model Details

  • Base Model: kenpath/mhv_mhv-_all_qwen3.5-27b_v0.3
  • Fine-tuning Method: LoRA (Low-Rank Adaptation)
  • Training Framework: HuggingFace Transformers + PEFT + DeepSpeed ZeRO-3
  • Merged: Full model weights (LoRA merged into base)

Training Configuration

LoRA Parameters

Parameter Value
Rank (r) 64
Alpha 64
Dropout 0.1
Target Modules ['q_proj', 'k_proj', 'v_proj', 'o_proj', 'gate_proj', 'up_proj', 'down_proj']
Trainable Parameters 318,767,104 / 27,214,765,568 (1.1713%)

Training Hyperparameters

Parameter Value
Epochs 1
Per-device Batch Size 1
Gradient Accumulation 1
Effective Batch Size 1
Learning Rate 5e-06
LR Scheduler cosine
Warmup Ratio N/A
Weight Decay 0.001
Max Grad Norm 1.0

Hardware & Performance

Metric Value
GPUs 8x H100 80GB
Training Duration 5.89 hours
Final Training Loss 0.4769
Dataset Size 2,226 samples
Max Sequence Length 100000

Dataset

  • Name: combined-mh-synthetic-vistaar
  • Samples Used: N/A

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained(
    "kenpath/mhv_synthetic_qwen3.5_v0.3",
    torch_dtype=torch.bfloat16,
    device_map="auto",
)
tokenizer = AutoTokenizer.from_pretrained("kenpath/mhv_synthetic_qwen3.5_v0.3")

messages = [
    {"role": "user", "content": "Your question here"}
]

input_ids = tokenizer.apply_chat_template(
    messages,
    add_generation_prompt=True,
    return_tensors="pt",
).to(model.device)

output = model.generate(input_ids, max_new_tokens=512)
print(tokenizer.decode(output[0], skip_special_tokens=True))

License

This model inherits the license from the base model kenpath/mhv_mhv-_all_qwen3.5-27b_v0.3.


Generated on 2026-03-16 05:21:25 | Merge took 4.8s

Downloads last month
322
Safetensors
Model size
27B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for kenpath/mahavistaar-llm-v1

Base model

Qwen/Qwen3.5-27B
Adapter
(1)
this model