--- base_model: RinaChen/Guwen-nomic-embed-text-v1.5 datasets: [] language: [] library_name: sentence-transformers pipeline_tag: sentence-similarity tags: - sentence-transformers - sentence-similarity - feature-extraction - generated_from_trainer - dataset_size:756057 - loss:MultipleNegativesRankingLoss - llama-cpp - gguf-my-repo widget: - source_sentence: 府君奈何以蓋世之才欲立忠於垂亡之國 sentences: - 將遠方進貢來的奇獸飛禽以及白山雞等物縱還山林比起雍畤的祭祀禮數頗有增加 - 您為什麼以蓋絕當世的奇才卻打算向這個面臨滅亡的國家盡效忠心呢 - 大統年間他出任岐州刺史在任不久就因為能力強而聞名 - source_sentence: 將率既至授單于印紱詔令上故印紱 sentences: - 已經到達的五威將到達後授給單于新印信宣讀詔書要求交回漢朝舊印信 - 於是拜陶隗為西南面招討使 - 司馬錯建議秦惠王攻打蜀國張儀說 還不如進攻韓國 - source_sentence: 行醮禮皇太子詣醴席樂作 sentences: - 閏七月十七日上宣宗廢除皇后胡氏尊諡 - 等到看見西羌鼠竊狗盜父不父子不子君臣沒有分別四夷之人西羌最為低下 - 行醮禮皇太子來到酒醴席奏樂 - source_sentence: 領軍臧盾太府卿沈僧果等並被時遇孝綽尤輕之 sentences: - 過了幾天太宰官又來要國書並且說 我國自太宰府以東上國使臣沒有到過今大朝派使臣來若不見國書何以相信 - 所以丹陽葛洪解釋說渾天儀注說 天體像雞蛋地就像是雞蛋中的蛋黃獨處於天體之內天是大的而地是小的 - 領軍臧盾太府卿沈僧果等都是因趕上時機而得到官職的孝綽尤其輕蔑他們每次在朝中集合會面雖然一起做官但從不與他們說話 - source_sentence: 九月辛未太祖曾孫舒國公從式進封安定郡王 sentences: - 九月初二太祖曾孫舒國公從式進封安定郡王 - 楊難當在漢中大肆燒殺搶劫然後率眾離開了漢中向西返回仇池留下趙溫據守梁州又派他的魏興太守薛健屯駐黃金山 - 正統元年普定蠻夷阿遲等反叛非法稱王四處出擊攻打掠奪 --- # RinaChen/Guwen-nomic-embed-text-v1.5-Q4_K_M-GGUF This model was converted to GGUF format from [`RinaChen/Guwen-nomic-embed-text-v1.5`](https://huggingface.co/RinaChen/Guwen-nomic-embed-text-v1.5) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space. Refer to the [original model card](https://huggingface.co/RinaChen/Guwen-nomic-embed-text-v1.5) for more details on the model. ## Use with llama.cpp Install llama.cpp through brew (works on Mac and Linux) ```bash brew install llama.cpp ``` Invoke the llama.cpp server or the CLI. ### CLI: ```bash llama-cli --hf-repo RinaChen/Guwen-nomic-embed-text-v1.5-Q4_K_M-GGUF --hf-file guwen-nomic-embed-text-v1.5-q4_k_m.gguf -p "The meaning to life and the universe is" ``` ### Server: ```bash llama-server --hf-repo RinaChen/Guwen-nomic-embed-text-v1.5-Q4_K_M-GGUF --hf-file guwen-nomic-embed-text-v1.5-q4_k_m.gguf -c 2048 ``` Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well. Step 1: Clone llama.cpp from GitHub. ``` git clone https://github.com/ggerganov/llama.cpp ``` Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux). ``` cd llama.cpp && LLAMA_CURL=1 make ``` Step 3: Run inference through the main binary. ``` ./llama-cli --hf-repo RinaChen/Guwen-nomic-embed-text-v1.5-Q4_K_M-GGUF --hf-file guwen-nomic-embed-text-v1.5-q4_k_m.gguf -p "The meaning to life and the universe is" ``` or ``` ./llama-server --hf-repo RinaChen/Guwen-nomic-embed-text-v1.5-Q4_K_M-GGUF --hf-file guwen-nomic-embed-text-v1.5-q4_k_m.gguf -c 2048 ```