top-papers/top-papers-graph-experts-data
Viewer • Updated • 1.87k • 1.78k • 1
How to use top-papers/Qwen3-VL-8B-Instruct-scireason with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3-VL-8B-Instruct")
model = PeftModel.from_pretrained(base_model, "top-papers/Qwen3-VL-8B-Instruct-scireason")This repository contains the fine-tuned SciReason VLM artifacts produced by the DataSphere SFT + GRPO pipeline.
outputs/hf_top_papers_qwen3vl_8b_grpo_lora for convenient loading.artifacts/sft_lora/: SFT LoRA adapter directory copied from outputs/hf_top_papers_qwen3vl_8b_sft_lora.artifacts/grpo_lora/: complete final GRPO output directory copied from outputs/hf_top_papers_qwen3vl_8b_grpo_lora.artifacts/archives/: compressed .tar.gz archives produced by the job.artifacts/data/: generated train/eval JSONL files and dataset summary.artifacts/reports/: budget, final summary, upload manifest and runtime reports.Qwen/Qwen3-VL-8B-Instructtop-papers/top-papers-graph-experts-datahf_top_papers_qwen3vl_8b2026-06-22T07:38:23ZThe root of this repository is prepared as the final GRPO adapter directory. For
LoRA/PEFT loading, use the same base model listed above and load this repository
as the adapter. The complete SFT and GRPO directories are also preserved under
artifacts/ for auditability and reproducibility.
Base model
Qwen/Qwen3-VL-8B-Instruct