Transformers
GGUF
English
llama
text-generation-inference
TheBloke commited on
Commit
6d432c7
1 Parent(s): aefcd0d

Upload in splits of max 50GB due to HF 50GB limit. (made with llama.cpp commit 0b871f1)

Browse files
.gitattributes CHANGED
@@ -45,3 +45,5 @@ yarn-llama-2-70b-32k.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
45
  yarn-llama-2-70b-32k.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
46
  yarn-llama-2-70b-32k.Q6_K.gguf-split-a filter=lfs diff=lfs merge=lfs -text
47
  yarn-llama-2-70b-32k.Q6_K.gguf-split-b filter=lfs diff=lfs merge=lfs -text
 
 
 
45
  yarn-llama-2-70b-32k.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
46
  yarn-llama-2-70b-32k.Q6_K.gguf-split-a filter=lfs diff=lfs merge=lfs -text
47
  yarn-llama-2-70b-32k.Q6_K.gguf-split-b filter=lfs diff=lfs merge=lfs -text
48
+ yarn-llama-2-70b-32k.Q8_0.gguf-split-a filter=lfs diff=lfs merge=lfs -text
49
+ yarn-llama-2-70b-32k.Q8_0.gguf-split-b filter=lfs diff=lfs merge=lfs -text
yarn-llama-2-70b-32k.Q8_0.gguf-split-a ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4e8138c7a72f79a4826dd5b009b8eddc2b9fee13bb2100c10a3731eaf66b79d7
3
+ size 36646165098
yarn-llama-2-70b-32k.Q8_0.gguf-split-b ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:87278a964e56b8e0ab764c1ab41ddf998fc4d40ca72fb27e5c0ec65cf2c59dd5
3
+ size 36646165078