Requisitos de hardware

by akudamcs - opened 4 days ago

Discussion

akudamcs

4 days ago

Qué hardware necesito para hacer uso del modelo en local?

Sakeador

Owner 4 days ago

Hi! To run AIkuda locally you'll need:

Minimum: 1× GPU with 24 GB VRAM (RTX 3090, RTX 4090, RTX 5090) running Q4 quantization (17 GB) via vLLM or Ollama.
Recommended: 1× GPU 24 GB with Q6 quantization (22 GB) for better quality.
Optimal: 3× GPUs with 16 GB each (48 GB total) running full BF16 with tensor parallelism.
CPU only: Possible with llama.cpp + GGUF Q4, but requires 64 GB RAM and will be slow.

We'll publish GGUF quantized versions alongside the main model for easier local deployment.
Stay tuned! 🛡️

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment