kalle07 commited on
Commit
a2a30ce
·
verified ·
1 Parent(s): f6f45b1

Create README.md

Browse files

Here a view embedding models that i have collected from hugging.
All file names named like or very similar to original if you like to search for.

i tested all models local with ALLM (AnythingLLM) with LM as Server
(they work, but every setting is different)

my short impression is:
- mug-b-1.6-q8_0.gguf
- mxbai-embed-large-v1.Q8_0.gguf
- nomic_f16.gguf/nomic-embed-text-v1.5.Q8_0.gguf
are okay
test other models are up to you...


short hint, set (Max Tokens)context-length of your main-model to 16k, set embedding (Max Embedding Chunk Length)1024, and (Max Context Snippets) 14
so you can receive 14 snippets a 1024token ~12000words from you document and have 2048token ~1500word left for the answer.
you can play for you need and VRAM usage



(ALLE licenses and terms of use go to original authors)

Files changed (1) hide show
  1. README.md +13 -0
README.md ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: sentence-transformers
3
+ pipeline_tag: sentence-similarity
4
+ tags:
5
+ - sentence-transformers
6
+ - sentence-similarity
7
+ - feature-extraction
8
+ - embedder
9
+ - embedding
10
+ - moedels
11
+ - GGUF
12
+
13
+ ---