Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
TheBloke
/
Lemur-70B-Chat-v1-GGUF
like
12
Text Generation
Transformers
GGUF
English
llama
code
text-generation-inference
License:
cc-by-nc-4.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
e9a1d78
Lemur-70B-Chat-v1-GGUF
1 contributor
History:
20 commits
TheBloke
GGUF model commit (made with llama.cpp commit 178b185)
e9a1d78
about 1 year ago
.gitattributes
2.33 kB
Add q6_K and q8_0 as split files due to 50GB limit
about 1 year ago
LICENSE.txt
7.02 kB
Add Llama 2 license files
about 1 year ago
Notice
112 Bytes
Add Llama 2 license files
about 1 year ago
README.md
16.4 kB
Update README.md
about 1 year ago
USE_POLICY.md
4.77 kB
Add Llama 2 license files
about 1 year ago
config.json
29 Bytes
Initial GGUF model commit
about 1 year ago
lemur-70b-chat-v1.Q2_K.gguf
29.3 GB
LFS
GGUF model commit (made with llama.cpp commit 178b185)
about 1 year ago
lemur-70b-chat-v1.Q3_K_L.gguf
36.1 GB
LFS
GGUF model commit (made with llama.cpp commit 178b185)
about 1 year ago
lemur-70b-chat-v1.Q3_K_M.gguf
33.2 GB
LFS
GGUF model commit (made with llama.cpp commit 178b185)
about 1 year ago
lemur-70b-chat-v1.Q3_K_S.gguf
29.9 GB
LFS
GGUF model commit (made with llama.cpp commit 178b185)
about 1 year ago
lemur-70b-chat-v1.Q4_K_M.gguf
41.4 GB
LFS
Initial GGUF model commit
about 1 year ago
lemur-70b-chat-v1.Q4_K_S.gguf
39.1 GB
LFS
Initial GGUF model commit
about 1 year ago
lemur-70b-chat-v1.Q5_K_M.gguf
48.8 GB
LFS
Initial GGUF model commit
about 1 year ago
lemur-70b-chat-v1.Q5_K_S.gguf
47.5 GB
LFS
Initial GGUF model commit
about 1 year ago
lemur-70b-chat-v1.Q6_K.gguf-split-a
36.7 GB
LFS
Add q6_K and q8_0 as split files due to 50GB limit
about 1 year ago
lemur-70b-chat-v1.Q6_K.gguf-split-b
19.9 GB
LFS
Add q6_K and q8_0 as split files due to 50GB limit
about 1 year ago
lemur-70b-chat-v1.Q8_0.gguf-split-a
36.7 GB
LFS
Add q6_K and q8_0 as split files due to 50GB limit
about 1 year ago
lemur-70b-chat-v1.Q8_0.gguf-split-b
36.6 GB
LFS
Add q6_K and q8_0 as split files due to 50GB limit
about 1 year ago