๐Ÿ“Š StatBot โ€” Qwen3-VL-2B Statistics Tutor for High School Students

A safe, guardrailed AI tutor specifically designed for teaching statistics and probability to high school students (ages 14โ€“18). Built for the Oklahoma Department of Education project. Runs offline on phones via PocketPal.


๐ŸŽฏ What This Model Does

StatBot is a fine-tuned vision-language model that:

  • ๐Ÿ“Š Teaches statistics โ€” mean, median, distributions, hypothesis testing, probability, regression, and more
  • ๐Ÿ–ผ๏ธ Reads images โ€” graphs, charts, histograms, textbook pages, handwritten formulas
  • ๐Ÿ›ก๏ธ Refuses unsafe content โ€” off-topic questions, jailbreaks, inappropriate requests
  • ๐Ÿ’™ Supports students in distress โ€” empathy + counselor referral
  • ๐ŸŽ“ Uses hint-first teaching โ€” guides students rather than just giving answers

๐Ÿ“ Files in This Repo

File Size What It Is
Qwen3VL-2B-StatBot-Q8_0.gguf 1.71 GB Main model (Q8_0 quantized)
Qwen3VL-2B-StatBot-mmproj.gguf 782 MB Vision projector (needed for images)

โš ๏ธ Both files required for full image + text capability!


๐Ÿ“ฑ How to Use on Your Phone (PocketPal)

  1. Install PocketPal from your app store
  2. Open PocketPal โ†’ Models โ†’ Search HuggingFace
  3. Search: Anudeeprao1/Qwen3VL-2B-StatBot_Instruct
  4. Download BOTH files:
    • โœ… Qwen3VL-2B-StatBot-Q8_0.gguf
    • โœ… Qwen3VL-2B-StatBot-mmproj.gguf
  5. Tap Load Model
  6. Start asking statistics questions or upload a graph!

๐Ÿ’ป How to Use with Ollama

ollama run hf.co/Anudeeprao1/Qwen3VL-2B-StatBot_Instruct

๐Ÿ’ป How to Use with llama.cpp

llama-server -hf Anudeeprao1/Qwen3VL-2B-StatBot_Instruct

Then open http://localhost:8080 in browser.


๐Ÿ’ก Example Questions

Text Only

  • "What is the difference between mean and median?"
  • "Explain the central limit theorem to me"
  • "How do I calculate standard deviation step by step?"
  • "What is a p-value?"

With Images

  • Upload a histogram โ†’ "What type of distribution is this?"
  • Upload a textbook page โ†’ "Explain this formula"
  • Upload a scatter plot โ†’ "What relationship does this show?"
  • Upload a bar chart โ†’ "Describe this data"

๐Ÿ›ก๏ธ Safety Features

StatBot is designed with 7 categories of guardrails:

Category Behavior
Off-topic questions Refuses politely, redirects to stats
Inappropriate content Refuses firmly
Identity probing Deflects, never reveals internals
Jailbreak attempts Resists prompt injection
Student distress Shows empathy, refers to counselor
Privacy questions Confirms FERPA + COPPA design
Educational audits Provides compliance info

โš™๏ธ Training Details

Parameter Value
Base model unsloth/Qwen3-VL-2B-Instruct
Fine-tuning method LoRA SFT (Unsloth)
LoRA rank 16
Quantization Q8_0 GGUF
Training samples 7,202
Guardrail samples 720 (7 categories ร— 3 repetitions)
Epochs 3
Learning rate 2e-4 (cosine schedule)
Effective batch size 16
Max sequence length 2048
Hardware NVIDIA RTX 4500 Ada (24 GB)
Framework Unsloth + TRL + Transformers

๐Ÿ“Š Performance

  • Final training loss: ~0.09
  • Final validation loss: ~0.09
  • Perplexity: ~1.1
  • No overfitting detected (train โ‰ˆ val loss)
  • Vision capability: Full multimodal support via mmproj

๐ŸŽ“ Intended Audience

  • โœ… High school statistics students (ages 14โ€“18)
  • โœ… Oklahoma high school teachers (classroom aid)
  • โœ… Statistics tutors needing safe AI assistance
  • โœ… Educational researchers studying AI guardrails
  • โŒ NOT for general-purpose chat (refuses non-stats)
  • โŒ NOT for elementary/middle school content

๐Ÿ›๏ธ Compliance & Safety

  • FERPA compliant design (no student data storage)
  • COPPA compliant for under-13 protections
  • Oklahoma DoE project deliverable
  • No cross-session memory (every chat is private)
  • Crisis-aware (refers to 988 Lifeline + adults)

๐Ÿ“š Try a Sample Conversation

User: "What is standard deviation?"

StatBot: "Great question! ๐ŸŽฏ Standard deviation tells us how spread out data is from the average...

Think of test scores: if everyone scored close to the average, SD is small. If scores are all over the place, SD is large.

Formula: ฯƒ = โˆš(ฮฃ(x - ฮผ)ยฒ / N)

Want me to walk through an example?"


User: "What is the capital of France?"

StatBot: "I'm StatBot, your statistics tutor designed for educational purposes only! I can only help with high school statistics topics. Try asking me about mean, median, probability, or graphs! Have a good day. ๐Ÿ“Š"


โš ๏ธ Limitations

  • Only English language
  • Only statistics/probability topics
  • Best for high school level (ages 14โ€“18)
  • Requires both GGUF files for image input
  • Q8_0 quantization (high quality but ~2 GB)

๐Ÿ“„ License

Apache 2.0 โ€” Free for educational and research use.


๐Ÿ‘ค Author

Anudeep โ€” Oklahoma Department of Education AI Tutoring Project

For questions or collaboration: open a discussion on this repo!


๐Ÿท๏ธ Citation

If you use this model in research, please cite:

@misc{statbot2026,
  author       = {Anudeep},
  title        = {StatBot: A Guardrailed Vision-Language Tutor for High School Statistics},
  year         = {2026},
  publisher    = {HuggingFace},
  url          = {https://huggingface.co/Anudeeprao1/Qwen3VL-2B-StatBot_Instruct}
}

๐Ÿ™ Acknowledgments

  • Unsloth โ€” for the 2x faster fine-tuning framework
  • Qwen team โ€” for the excellent base VL model
  • Oklahoma DoE โ€” for funding this educational AI research
  • PocketPal โ€” for offline mobile inference support
Downloads last month
93
GGUF
Model size
2B params
Architecture
qwen3vl
Hardware compatibility
Log In to add your hardware

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Anudeeprao1/Qwen3VL-2B-StatBot_Instruct

Quantized
(2)
this model