add-quantized-gguf-files

by espenhk - opened May 21, 2024

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+20

-0

espenhk

NorwAI org May 21, 2024

No description provided.

Add GGUF files for Q3-Q8 variants (mostly K_M), track with LFS1b40fd3e

espenhk

NorwAI org May 21, 2024

Same as with the regular Mistral and Llama-models, this PR will add quantized GGUF versions of the model s.t. you can run it on your own machine (for instance using Ollama). No particular config choices were made, other than using f16 vectors (matches previous, as well as the safetensors files).

espenhk

NorwAI org May 21, 2024

Ready on my end!

espenhk changed pull request status to open May 21, 2024

NorLLM-NTNU changed pull request status to merged May 31, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment