view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28, 2024 ā¢ 187
view post Post 2320 š¢ If you wish to empower LLM with IR and named entity recognition module, then I got relevant findings. Just tested Flair below is how you can start for adapting for processing your CSV / JSONL data via bulk-nerš©āš» code: https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/ner_flair_0151.shš¤ models: https://huggingface.co/flairProvider: https://raw.githubusercontent.com/nicolay-r/nlp-thirdgate/refs/heads/master/ner/flair_0151.pyFramework: https://github.com/nicolay-r/bulk-nerš Performance: the default ner model (Thinkpad X1 Nano)Batch-size 1 6it/secBatch-size 10+ 12it/secš other wrappers for bulk-ner nlp-thirdgate: https://github.com/nicolay-r/nlp-thirdgate See translation š 6 6 + Reply
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity ā¢ Updated Nov 1, 2024 ā¢ 96.4M ā¢ ā¢ 3.02k