AVA logo

AVA v2 (merged weights)

Standalone bf16 weights of AVA v2 — the QLoRA adapter pre-merged into Qwen/Qwen3.5-2B. Load directly with transformers, no PEFT required:

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained(
    "NAME0x0/AVA-v2-merged", device_map="auto", dtype="bfloat16"
)
tokenizer = AutoTokenizer.from_pretrained("NAME0x0/AVA-v2-merged")
  • Benchmarks, training details, limitations: see the adapter card — 82.0% ARC-Challenge, 92.0% ARC-Easy, 59.2% MMLU at 2B params, trained on a single 4 GB laptop GPU.
  • No Python / CPU-only: use the GGUF builds (Ollama, llama.cpp, LM Studio).
  • Reproduce everything: github.com/NAME0x0/AVA.
Downloads last month
10
Safetensors
Model size
2B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for NAME0x0/AVA-v2-merged

Finetuned
Qwen/Qwen3.5-2B
Finetuned
(196)
this model