Quan Nguyen's picture

Quan Nguyen PRO

qnguyen3

·

qnguyen3

AI & ML interests

None yet

Organizations

Posts 1

Post

2907

🎉 Introducing nanoLLaVA, a powerful multimodal AI model that packs the capabilities of a 1B parameter vision language model into just 5GB of VRAM. 🚀 This makes it an ideal choice for edge devices, bringing cutting-edge visual understanding and generation to your devices like never before. 📱💻

Model: qnguyen3/nanoLLaVA 🔍
Spaces: qnguyen3/nanoLLaVA (thanks to @merve )

Under the hood, nanoLLaVA is based on the powerful vilm/Quyen-SE-v0.1 (my Qwen1.5-0.5B finetune) and Google's impressive google/siglip-so400m-patch14-384. 🧠 The model is trained using a data-centric approach to ensure optimal performance. 📊

In the spirit of transparency and collaboration, all code and model weights are open-sourced under the Apache 2.0 license. 🤝

Collections 2

Papers 1

arxiv:2312.11011

spaces 1

Running on Zero

nanoLLaVA

models 9

qnguyen3/nanoLLaVA

Text Generation • Updated 14 days ago • 2.34k • 116

qnguyen3/quan-1.8b-chat

Text Generation • Updated Feb 21 • 2.54k • 9

qnguyen3/quan-1.8b-base-v2

Text Generation • Updated Jan 31 • 5

qnguyen3/Mixtral-4x400M

Text Generation • Updated Jan 23 • 15 • 2

qnguyen3/quan-1.8b-base

Text Generation • Updated Jan 20 • 2.75k • 3

qnguyen3/quan-1.8b-chat-GGUF

Updated Jan 16 • 45 • 1

qnguyen3/deepseek-vi-qlora-1e

Text Generation • Updated Jan 15 • 4

qnguyen3/vinallama-16b-chat-franken

Text Generation • Updated Jan 4 • 4 • 1

qnguyen3/look-1-v0.1

Updated Oct 23, 2023

datasets 5

qnguyen3/Viet-ORPO-Mix

Updated 5 days ago

qnguyen3/demo_faq

Viewer • Updated Mar 29 • 5

qnguyen3/llava-fn-calling

Viewer • Updated Dec 26, 2023 • 1 • 23

qnguyen3/ocr_vqa

Preview • Updated Oct 23, 2023 • 2

qnguyen3/alapaca-vi

Updated Jun 14, 2023