Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
TheBloke
/
sonya-medium-x8-MoE-GGUF
like
4
Transformers
GGUF
mixtral
text-generation-inference
License:
wtfpl
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
c23f40a
sonya-medium-x8-MoE-GGUF
1 contributor
History:
11 commits
TheBloke
Upload README.md
c23f40a
9 months ago
.gitattributes
2.22 kB
Upload in splits of max 50GB due to HF 50GB limit. (made with llama.cpp commit c75ca5d)
9 months ago
README.md
21.3 kB
Upload README.md
9 months ago
config.json
31 Bytes
GGUF model commit (made with llama.cpp commit c75ca5d)
9 months ago
sonya-medium-x8-moe.Q2_K.gguf
23.4 GB
LFS
GGUF model commit (made with llama.cpp commit c75ca5d)
9 months ago
sonya-medium-x8-moe.Q3_K_M.gguf
30.5 GB
LFS
GGUF model commit (made with llama.cpp commit c75ca5d)
9 months ago
sonya-medium-x8-moe.Q4_0.gguf
39.6 GB
LFS
GGUF model commit (made with llama.cpp commit c75ca5d)
9 months ago
sonya-medium-x8-moe.Q4_K_M.gguf
39.6 GB
LFS
GGUF model commit (made with llama.cpp commit c75ca5d)
9 months ago
sonya-medium-x8-moe.Q5_0.gguf
48.2 GB
LFS
GGUF model commit (made with llama.cpp commit c75ca5d)
9 months ago
sonya-medium-x8-moe.Q5_K_M.gguf
48.2 GB
LFS
GGUF model commit (made with llama.cpp commit c75ca5d)
9 months ago
sonya-medium-x8-moe.Q6_K.gguf-split-a
28.7 GB
LFS
Upload in splits of max 50GB due to HF 50GB limit. (made with llama.cpp commit c75ca5d)
9 months ago
sonya-medium-x8-moe.Q6_K.gguf-split-b
28.7 GB
LFS
Upload in splits of max 50GB due to HF 50GB limit. (made with llama.cpp commit c75ca5d)
9 months ago
sonya-medium-x8-moe.Q8_0.gguf-split-a
37.1 GB
LFS
Upload in splits of max 50GB due to HF 50GB limit. (made with llama.cpp commit c75ca5d)
9 months ago
sonya-medium-x8-moe.Q8_0.gguf-split-b
37.1 GB
LFS
Upload in splits of max 50GB due to HF 50GB limit. (made with llama.cpp commit c75ca5d)
9 months ago