kalle07
/

embedder_collection

Model card Files Files and versions

kalle07 commited on Mar 11

Commit

0ea6c62

·

verified ·

1 Parent(s): ca93599

Update README.md

Files changed (1) hide show

README.md +5 -4

README.md CHANGED Viewed

@@ -30,11 +30,12 @@ They work more or less (sometimes the results are more truthful if the “chat w
 &rarr; some models can not hande large TXT files (maybe only 200pages - hints below)
 <br>
 <b>My short impression:</b>
 <li>nomic-embed-text</li>
 <li>mxbai-embed-large</li>
 <li>mug-b-1.6</li>
 <li>Ger-RAG-BGE-M3 (german)</li>
 Working well, all other its up to you!
 <br>
@@ -47,11 +48,11 @@ but in ALLM its cutting all in 1024 character parts, so aprox two times or bit m
 You can receive 14-snippets a 1024t (14336t) from your document ~10000words and 1600t left for the answer ~1000words (2 pages)
 You can play and set for your needs, eg 8-snippets a 2048t, or 28-snippets a 512t ...
 <li>8000t (~6000words) ~0.8GB VRAM usage</li>
 <li>16000t (~12000words) ~1.5GB VRAM usage</li>
 <li>32000t (~24000words) ~3GB VRAM usage</li>
 <br>
 ...
@@ -91,7 +92,7 @@ on discord (sevenof9)
 ...
-<ul style="line-height: 0.8;">
 <li>avemio/German-RAG-BGE-M3-MERGED-x-SNOWFLAKE-ARCTIC-HESSIAN-AI (German, English) - 600pages and more </li>
 <li>maidalun1020/bce-embedding-base_v1 (English and Chinese) - only ~200pages </li>
 <li>maidalun1020/bce-reranker-base_v1 (English, Chinese, Japanese and Korean) - only ~200pages</li>

 &rarr; some models can not hande large TXT files (maybe only 200pages - hints below)
 <br>
 <b>My short impression:</b>
+<ul style="line-height: 1;">
 <li>nomic-embed-text</li>
 <li>mxbai-embed-large</li>
 <li>mug-b-1.6</li>
 <li>Ger-RAG-BGE-M3 (german)</li>
+</ul>
 Working well, all other its up to you!
 <br>
 You can receive 14-snippets a 1024t (14336t) from your document ~10000words and 1600t left for the answer ~1000words (2 pages)
 You can play and set for your needs, eg 8-snippets a 2048t, or 28-snippets a 512t ...
+<ul style="line-height: 1;">
 <li>8000t (~6000words) ~0.8GB VRAM usage</li>
 <li>16000t (~12000words) ~1.5GB VRAM usage</li>
 <li>32000t (~24000words) ~3GB VRAM usage</li>
+</ul>
 <br>
 ...
 ...
+<ul style="line-height: 1;">
 <li>avemio/German-RAG-BGE-M3-MERGED-x-SNOWFLAKE-ARCTIC-HESSIAN-AI (German, English) - 600pages and more </li>
 <li>maidalun1020/bce-embedding-base_v1 (English and Chinese) - only ~200pages </li>
 <li>maidalun1020/bce-reranker-base_v1 (English, Chinese, Japanese and Korean) - only ~200pages</li>