Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
hexgrid-cloud 's Collections
Best Open-Source Coding LLMs for Private Deployment
Production-Ready Quantized Chat LLMs — 4-bit & 8-bit
Open Source RAG Stack — Embed + Rerank + Generate
One-click LLM deployments on Private GPU

Open Source RAG Stack — Embed + Rerank + Generate

updated 16 days ago

The complete open-source RAG pipeline. Best of the embedding models, one reranker, one chat model. All deployable on dedicated GPUs at hexgrid.cloud

Upvote
-

  • Qwen/Qwen3.5-9B

    Image-Text-to-Text • 10B • Updated Mar 2 • 9.83M • • 1.6k

  • Qwen/Qwen3.5-27B

    Image-Text-to-Text • 28B • Updated Apr 24 • 2.5M • • 988

  • Qwen/Qwen3-Embedding-8B

    Feature Extraction • 8B • Updated Jul 7, 2025 • 2.4M • • 713

  • mixedbread-ai/mxbai-rerank-large-v2

    Text Ranking • 2B • Updated Apr 8 • 54.1k • 141

  • Qwen/Qwen3-Reranker-4B

    Text Ranking • 4B • Updated Apr 16 • 1.63M • 143

  • Qwen/Qwen3-Embedding-4B

    Feature Extraction • 4B • Updated Jun 20, 2025 • 2.24M • 288
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs