sbert-compat

by taeminlee - opened Oct 14, 2024

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

+11

-149

Files changed (5) hide show

1_Pooling/config.json +0 -10
2_Normalize/__init__ +0 -0
README.md +11 -110
config_sentence_transformers.json +0 -9
modules.json +0 -20

1_Pooling/config.json DELETED Viewed

@@ -1,10 +0,0 @@
-{
-  "word_embedding_dimension": 1024,
-  "pooling_mode_cls_token": false,
-  "pooling_mode_mean_tokens": true,
-  "pooling_mode_max_tokens": false,
-  "pooling_mode_mean_sqrt_len_tokens": false,
-  "pooling_mode_weightedmean_tokens": false,
-  "pooling_mode_lasttoken": false,
-  "include_prompt": true
-}

2_Normalize/__init__ DELETED Viewed

File without changes

README.md CHANGED Viewed

@@ -14,18 +14,13 @@ pipeline_tag: feature-extraction
 # 🔎 KoE5
 Introducing KoE5, a model with advanced retrieval abilities.
-It has shown remarkable performance in Korean text retrieval.
-For details, visit the [KURE repository](https://github.com/nlpai-lab/KURE)
 ---
-## Model Versions
-| Model Name |  Dimension | Sequence Length | Introduction |
-|:----:|:---:|:---:|:---:|
-| [KURE-v1](https://huggingface.co/nlpai-lab/KURE-v1) | 1024 | 8192 | Fine-tuned [BAAI/bge-m3](https://huggingface.co/BAAI/bge-m3) with Korean data via [CachedGISTEmbedLoss](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cachedgistembedloss)
-| [KoE5](https://huggingface.co/nlpai-lab/KoE5) |  1024 | 512 | Fine-tuned [intfloat/multilingual-e5-large](https://huggingface.co/intfloat/multilingual-e5-large) with [ko-triplet-v1.0](https://huggingface.co/datasets/nlpai-lab/ko-triplet-v1.0) via [CachedMultipleNegativesRankingLoss](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cachedmultiplenegativesrankingloss) |
 ### Model Description
 This is the model card of a 🤗 transformers model that has been pushed on the Hub.
@@ -86,105 +81,17 @@ print(similarities)
 ## Evaluation
 ### Metrics
-- Recall, Precision, NDCG, F1
 ### Benchmark Datasets
-- [Ko-StrategyQA](https://huggingface.co/datasets/taeminlee/Ko-StrategyQA): 한국어 ODQA multi-hop 검색 데이터셋 (StrategyQA 번역)
-- [AutoRAGRetrieval](https://huggingface.co/datasets/yjoonjang/markers_bm): 금융, 공공, 의료, 법률, 커머스 5개 분야에 대해, pdf를 파싱하여 구성한 한국어 문서 검색 데이터셋
-- [MIRACLRetrieval]([url](https://huggingface.co/datasets/miracl/miracl)): Wikipedia 기반의 한국어 문서 검색 데이터셋
-- [PublicHealthQA]([url](https://huggingface.co/datasets/xhluca/publichealth-qa)): 의료 및 공중보건 도메인에 대한 한국어 문서 검색 데이터셋
-- [BelebeleRetrieval]([url](https://huggingface.co/datasets/facebook/belebele)): FLORES-200 기반의 한국어 문서 검색 데이터셋
-- [MrTidyRetrieval](https://huggingface.co/datasets/mteb/mrtidy): Wikipedia 기반의 한국어 문서 검색 데이터셋
-- [MultiLongDocRetrieval](https://huggingface.co/datasets/Shitao/MLDR): 다양한 도메인의 한국어 장문 검색 데이터셋
-- [XPQARetrieval](https://huggingface.co/datasets/jinaai/xpqa): 다양한 도메인의 한국어 문서 검색 데이터셋
 ## Results
-아래는 모든 모델의, 모든 벤치마크 데이터셋에 대한 평균 결과입니다.
-자세한 결과는 [KURE Github](https://github.com/nlpai-lab/KURE/tree/main/eval/results)에서 확인하실 수 있습니다.
-### Top-k 1
-| Model                                   | Average Recall_top1 | Average Precision_top1 | Average NDCG_top1 | Average F1_top1 |
-|-----------------------------------------|----------------------|------------------------|-------------------|-----------------|
-| **nlpai-lab/KURE-v1**                   | **0.52640**          | **0.60551**            | **0.60551**       | **0.55784**     |
-| dragonkue/BGE-m3-ko                     | 0.52361              | 0.60394                | 0.60394           | 0.55535         |
-| BAAI/bge-m3                             | 0.51778              | 0.59846                | 0.59846           | 0.54998         |
-| Snowflake/snowflake-arctic-embed-l-v2.0 | 0.51246              | 0.59384                | 0.59384           | 0.54489         |
-| nlpai-lab/KoE5                          | 0.50157              | 0.57790                | 0.57790           | 0.53178         |
-| intfloat/multilingual-e5-large          | 0.50052              | 0.57727                | 0.57727           | 0.53122         |
-| jinaai/jina-embeddings-v3               | 0.48287              | 0.56068                | 0.56068           | 0.51361         |
-| BAAI/bge-multilingual-gemma2            | 0.47904              | 0.55472                | 0.55472           | 0.50916         |
-| intfloat/multilingual-e5-large-instruct | 0.47842              | 0.55435                | 0.55435           | 0.50826         |
-| intfloat/multilingual-e5-base           | 0.46950              | 0.54490                | 0.54490           | 0.49947         |
-| intfloat/e5-mistral-7b-instruct         | 0.46772              | 0.54394                | 0.54394           | 0.49781         |
-| Alibaba-NLP/gte-multilingual-base       | 0.46469              | 0.53744                | 0.53744           | 0.49353         |
-| Alibaba-NLP/gte-Qwen2-7B-instruct       | 0.46633              | 0.53625                | 0.53625           | 0.49429         |
-| openai/text-embedding-3-large           | 0.44884              | 0.51688                | 0.51688           | 0.47572         |
-| Salesforce/SFR-Embedding-2_R            | 0.43748              | 0.50815                | 0.50815           | 0.46504         |
-| upskyy/bge-m3-korean                    | 0.43125              | 0.50245                | 0.50245           | 0.45945         |
-| jhgan/ko-sroberta-multitask             | 0.33788              | 0.38497                | 0.38497           | 0.35678         |
-### Top-k 3
-| Model                                   | Average Recall_top1 | Average Precision_top1 | Average NDCG_top1 | Average F1_top1 |
-|-----------------------------------------|----------------------|------------------------|-------------------|-----------------|
-| **nlpai-lab/KURE-v1**                   | **0.68678**          | **0.28711**            | **0.65538**       | **0.39835**     |
-| dragonkue/BGE-m3-ko                     | 0.67834              | 0.28385                | 0.64950           | 0.39378         |
-| BAAI/bge-m3                             | 0.67526              | 0.28374                | 0.64556           | 0.39291         |
-| Snowflake/snowflake-arctic-embed-l-v2.0 | 0.67128              | 0.28193                | 0.64042           | 0.39072         |
-| intfloat/multilingual-e5-large          | 0.65807              | 0.27777                | 0.62822           | 0.38423         |
-| nlpai-lab/KoE5                          | 0.65174              | 0.27329                | 0.62369           | 0.37882         |
-| BAAI/bge-multilingual-gemma2            | 0.64415              | 0.27416                | 0.61105           | 0.37782         |
-| jinaai/jina-embeddings-v3               | 0.64116              | 0.27165                | 0.60954           | 0.37511         |
-| intfloat/multilingual-e5-large-instruct | 0.64353              | 0.27040                | 0.60790           | 0.37453         |
-| Alibaba-NLP/gte-multilingual-base       | 0.63744              | 0.26404                | 0.59695           | 0.36764         |
-| Alibaba-NLP/gte-Qwen2-7B-instruct       | 0.63163              | 0.25937                | 0.59237           | 0.36263         |
-| intfloat/multilingual-e5-base           | 0.62099              | 0.26144                | 0.59179           | 0.36203         |
-| intfloat/e5-mistral-7b-instruct         | 0.62087              | 0.26144                | 0.58917           | 0.36188         |
-| openai/text-embedding-3-large           | 0.61035              | 0.25356                | 0.57329           | 0.35270         |
-| Salesforce/SFR-Embedding-2_R            | 0.60001              | 0.25253                | 0.56346           | 0.34952         |
-| upskyy/bge-m3-korean                    | 0.59215              | 0.25076                | 0.55722           | 0.34623         |
-| jhgan/ko-sroberta-multitask             | 0.46930              | 0.18994                | 0.43293           | 0.26696         |
-### Top-k 5
-| Model                                   | Average Recall_top1 | Average Precision_top1 | Average NDCG_top1 | Average F1_top1 |
-|-----------------------------------------|----------------------|------------------------|-------------------|-----------------|
-| **nlpai-lab/KURE-v1**                   | **0.73851**          | **0.19130**            | **0.67479**       | **0.29903**     |
-| dragonkue/BGE-m3-ko                     | 0.72517              | 0.18799                | 0.66692           | 0.29401         |
-| BAAI/bge-m3                             | 0.72954              | 0.18975                | 0.66615           | 0.29632         |
-| Snowflake/snowflake-arctic-embed-l-v2.0 | 0.72962              | 0.18875                | 0.66236           | 0.29542         |
-| nlpai-lab/KoE5                          | 0.70820              | 0.18287                | 0.64499           | 0.28628         |
-| intfloat/multilingual-e5-large          | 0.70124              | 0.18316                | 0.64402           | 0.28588         |
-| BAAI/bge-multilingual-gemma2            | 0.70258              | 0.18556                | 0.63338           | 0.28851         |
-| jinaai/jina-embeddings-v3               | 0.69933              | 0.18256                | 0.63133           | 0.28505         |
-| intfloat/multilingual-e5-large-instruct | 0.69018              | 0.17838                | 0.62486           | 0.27933         |
-| Alibaba-NLP/gte-multilingual-base       | 0.69365              | 0.17789                | 0.61896           | 0.27879         |
-| intfloat/multilingual-e5-base           | 0.67250              | 0.17406                | 0.61119           | 0.27247         |
-| Alibaba-NLP/gte-Qwen2-7B-instruct       | 0.67447              | 0.17114                | 0.60952           | 0.26943         |
-| intfloat/e5-mistral-7b-instruct         | 0.67449              | 0.17484                | 0.60935           | 0.27349         |
-| openai/text-embedding-3-large           | 0.66365              | 0.17004                | 0.59389           | 0.26677         |
-| Salesforce/SFR-Embedding-2_R            | 0.65622              | 0.17018                | 0.58494           | 0.26612         |
-| upskyy/bge-m3-korean                    | 0.65477              | 0.17015                | 0.58073           | 0.26589         |
-| jhgan/ko-sroberta-multitask             | 0.53136              | 0.13264                | 0.45879           | 0.20976         |
-### Top-k 10
-| Model                                   | Average Recall_top1 | Average Precision_top1 | Average NDCG_top1 | Average F1_top1 |
-|-----------------------------------------|----------------------|------------------------|-------------------|-----------------|
-| **nlpai-lab/KURE-v1**                   | **0.79682**          | **0.10624**            | **0.69473**       | **0.18524**     |
-| dragonkue/BGE-m3-ko                     | 0.78450              | 0.10492                | 0.68748           | 0.18288         |
-| BAAI/bge-m3                             | 0.79195              | 0.10592                | 0.68723           | 0.18456         |
-| Snowflake/snowflake-arctic-embed-l-v2.0 | 0.78669              | 0.10462                | 0.68189           | 0.18260         |
-| intfloat/multilingual-e5-large          | 0.75902              | 0.10147                | 0.66370           | 0.17693         |
-| nlpai-lab/KoE5                          | 0.75296              | 0.09937                | 0.66012           | 0.17369         |
-| BAAI/bge-multilingual-gemma2            | 0.76153              | 0.10364                | 0.65330           | 0.18003         |
-| jinaai/jina-embeddings-v3               | 0.76277              | 0.10240                | 0.65290           | 0.17843         |
-| intfloat/multilingual-e5-large-instruct | 0.74851              | 0.09888                | 0.64451           | 0.17283         |
-| Alibaba-NLP/gte-multilingual-base       | 0.75631              | 0.09938                | 0.64025           | 0.17363         |
-| Alibaba-NLP/gte-Qwen2-7B-instruct       | 0.74092              | 0.09607                | 0.63258           | 0.16847         |
-| intfloat/multilingual-e5-base           | 0.73512              | 0.09717                | 0.63216           | 0.16977         |
-| intfloat/e5-mistral-7b-instruct         | 0.73795              | 0.09777                | 0.63076           | 0.17078         |
-| openai/text-embedding-3-large           | 0.72946              | 0.09571                | 0.61670           | 0.16739         |
-| Salesforce/SFR-Embedding-2_R            | 0.71662              | 0.09546                | 0.60589           | 0.16651         |
-| upskyy/bge-m3-korean                    | 0.71895              | 0.09583                | 0.60258           | 0.16712         |
-| jhgan/ko-sroberta-multitask             | 0.61225              | 0.07826                | 0.48687           | 0.13757         |
-<br/>
 ## FAQ
@@ -203,12 +110,6 @@ Here are some rules of thumb:
 If you find our paper or models helpful, please consider cite as follows:
 ```text
-@misc{KURE,
-  publisher = {Youngjoon Jang, Junyoung Son, Taemin Lee},
-  year = {2024},
-  url = {https://github.com/nlpai-lab/KURE}
-},
 @misc{KoE5,
   author = {NLP & AI Lab and Human-Inspired AI research},
   title = {KoE5: A New Dataset and Model for Improving Korean Embedding Performance},

 # 🔎 KoE5
 Introducing KoE5, a model with advanced retrieval abilities.
+It has shown remarkable performance in Korean text retrieval, speficially overwhelming most multilingual embedding models.
+To our knowledge, It is one of the best publicly opened Korean retrieval models.
+For details, visit the [KoE5 repository](https://github.com/nlpai-lab/KoE5)
 ---
 ### Model Description
 This is the model card of a 🤗 transformers model that has been pushed on the Hub.
 ## Evaluation
 ### Metrics
+- NDCG@1, F1@1, NDCG@3, F1@3
 ### Benchmark Datasets
+- Ko-strategyQA
+- AutoRAG-benchmark
+- PublicHealthQA
 ## Results
+- By datasets
+<img src="KoE5-results-by-datasets.png" width=100%>
+- Average
+<img src="KoE5-results-avg.png" width=100%>
 ## FAQ
 If you find our paper or models helpful, please consider cite as follows:
 ```text
 @misc{KoE5,
   author = {NLP & AI Lab and Human-Inspired AI research},
   title = {KoE5: A New Dataset and Model for Improving Korean Embedding Performance},

config_sentence_transformers.json DELETED Viewed

@@ -1,9 +0,0 @@
-{
-  "__version__": {
-    "sentence_transformers": "2.6.1",
-    "transformers": "4.39.3",
-    "pytorch": "2.2.2"
-  },
-  "prompts": {},
-  "default_prompt_name": null
-}

modules.json DELETED Viewed

@@ -1,20 +0,0 @@
-[
-  {
-    "idx": 0,
-    "name": "0",
-    "path": "",
-    "type": "sentence_transformers.models.Transformer"
-  },
-  {
-    "idx": 1,
-    "name": "1",
-    "path": "1_Pooling",
-    "type": "sentence_transformers.models.Pooling"
-  },
-  {
-    "idx": 2,
-    "name": "2",
-    "path": "2_Normalize",
-    "type": "sentence_transformers.models.Normalize"
-  }
-]