AVA logo

AVA v2 (merged weights)

Standalone bf16 weights of AVA v2 — the QLoRA adapter pre-merged into Qwen/Qwen3.5-2B. Load directly with transformers, no PEFT required:

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained(
    "NAME0x0/AVA-v2-merged", device_map="auto", dtype="bfloat16"
)
tokenizer = AutoTokenizer.from_pretrained("NAME0x0/AVA-v2-merged")

Benchmarks, training details, limitations: see the adapter card — 82.0% ARC-Challenge, 92.0% ARC-Easy, 59.2% MMLU at 2B params, trained on a single 4 GB laptop GPU.
No Python / CPU-only: use the GGUF builds (Ollama, llama.cpp, LM Studio).
Reproduce everything: github.com/NAME0x0/AVA.

Downloads last month: 10

Safetensors

Model size

2B params

Tensor type

BF16

Model tree for NAME0x0/AVA-v2-merged

Base model

Qwen/Qwen3.5-2B-Base

Finetuned

Qwen/Qwen3.5-2B

Finetuned

(196)

this model