lambda/hermes-agent-reasoning-traces
Viewer • Updated • 14.7k • 4.78k • 325
How to use AlihanSDev/kindlehare-qwen7b-lora with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-7B-Instruct")
model = PeftModel.from_pretrained(base_model, "AlihanSDev/kindlehare-qwen7b-lora")QLoRA fine-tune of Qwen/Qwen2.5-7B-Instruct on agentic reasoning traces.
Dataset: lambda/hermes-agent-reasoning-traces (kimi split)
| Parameter | Value |
|---|---|
| Hardware | 1x NVIDIA T4 (Kaggle) |
| Epochs | 1 |
| Learning rate | 1e-4 |
| Batch size | 1 (grad acc 8) |
| LoRA rank | 16 |
| LoRA alpha | 16 |
| Target modules | q/k/v/o/gate/up/down proj |
| Precision | fp16 |
| Optimizer | paged_adamw_8bit |
| Max sequence length | 1024 |
| Training time | ~8 hours |
from transformers import AutoTokenizer, AutoModelForCausalLM
from peft import PeftModel
import torch
base_model = AutoModelForCausalLM.from_pretrained(
"Qwen/Qwen2.5-7B-Instruct",
torch_dtype=torch.float16,
device_map="auto"
)
model = PeftModel.from_pretrained(base_model, "AlihanSDev/kindlehare-qwen7b-lora")
tokenizer = AutoTokenizer.from_pretrained("AlihanSDev/kindlehare-qwen7b-lora")
messages = [{"role": "user", "content": "Write a Python function to reverse a string."}]
prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=200)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))