Update README.md
Browse files
README.md
CHANGED
@@ -1,23 +1,31 @@
|
|
1 |
---
|
2 |
-
license: apache-2.0
|
3 |
-
language:
|
4 |
-
- en
|
5 |
-
pipeline_tag: feature-extraction
|
6 |
tags:
|
7 |
-
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
8 |
---
|
9 |
-
|
10 |
<br><br>
|
11 |
|
12 |
<p align="center">
|
13 |
-
<img src="https://
|
14 |
</p>
|
15 |
|
16 |
|
17 |
<p align="center">
|
18 |
-
<b>The text embedding set trained by <a href="https://jina.ai/"><b>Jina AI</b></a
|
19 |
</p>
|
20 |
|
|
|
|
|
|
|
|
|
21 |
|
22 |
## Intended Usage & Model Info
|
23 |
|
@@ -35,17 +43,11 @@ This makes our model useful for a range of use cases, especially when processing
|
|
35 |
This model has 137 million parameters, which enables fast and memory efficient inference, while delivering impressive performance.
|
36 |
Additionally, we provide the following embedding models:
|
37 |
|
38 |
-
**V1 (Based on T5, 512 Seq)**
|
39 |
-
|
40 |
-
- [`jina-embeddings-v1-small-en`](https://huggingface.co/jinaai/jina-embedding-s-en-v1): 35 million parameters.
|
41 |
-
- [`jina-embeddings-v1-base-en`](https://huggingface.co/jinaai/jina-embedding-b-en-v1): 110 million parameters.
|
42 |
-
- [`jina-embeddings-v1-large-en`](https://huggingface.co/jinaai/jina-embedding-l-en-v1): 330 million parameters.
|
43 |
-
|
44 |
-
**V2 (Based on JinaBert, 8k Seq)**
|
45 |
-
|
46 |
- [`jina-embeddings-v2-small-en`](https://huggingface.co/jinaai/jina-embeddings-v2-small-en): 33 million parameters.
|
47 |
-
- [`jina-embeddings-v2-base-en`](https://huggingface.co/jinaai/jina-embeddings-v2-base-en): 137 million parameters
|
48 |
-
- [`jina-embeddings-v2-base-
|
|
|
|
|
49 |
|
50 |
**<details><summary>Supported (Programming) Languages</summary>**
|
51 |
<p>
|
|
|
1 |
---
|
|
|
|
|
|
|
|
|
2 |
tags:
|
3 |
+
- sentence-transformers
|
4 |
+
- feature-extraction
|
5 |
+
- sentence-similarity
|
6 |
+
- mteb
|
7 |
+
datasets:
|
8 |
+
- allenai/c4
|
9 |
+
language: en
|
10 |
+
inference: false
|
11 |
+
license: apache-2.0
|
12 |
---
|
13 |
+
<!-- TODO: add evaluation results here -->
|
14 |
<br><br>
|
15 |
|
16 |
<p align="center">
|
17 |
+
<img src="https://aeiljuispo.cloudimg.io/v7/https://cdn-uploads.huggingface.co/production/uploads/603763514de52ff951d89793/AFoybzd5lpBQXEBrQHuTt.png?w=200&h=200&f=face" alt="Finetuner logo: Finetuner helps you to create experiments in order to improve embeddings on search tasks. It accompanies you to deliver the last mile of performance-tuning for neural search applications." width="150px">
|
18 |
</p>
|
19 |
|
20 |
|
21 |
<p align="center">
|
22 |
+
<b>The text embedding set trained by <a href="https://jina.ai/"><b>Jina AI</b></a>.</b>
|
23 |
</p>
|
24 |
|
25 |
+
## Quick Start
|
26 |
+
|
27 |
+
The easiest way to starting using `jina-embeddings-v2-base-en` is to use Jina AI's [Embedding API](https://jina.ai/embeddings/).
|
28 |
+
|
29 |
|
30 |
## Intended Usage & Model Info
|
31 |
|
|
|
43 |
This model has 137 million parameters, which enables fast and memory efficient inference, while delivering impressive performance.
|
44 |
Additionally, we provide the following embedding models:
|
45 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
46 |
- [`jina-embeddings-v2-small-en`](https://huggingface.co/jinaai/jina-embeddings-v2-small-en): 33 million parameters.
|
47 |
+
- [`jina-embeddings-v2-base-en`](https://huggingface.co/jinaai/jina-embeddings-v2-base-en): 137 million parameters **(you are here)**.
|
48 |
+
- [`jina-embeddings-v2-base-zh`](https://huggingface.co/jinaai/jina-embeddings-v2-base-zh): Chinese-English Bilingual embeddings.
|
49 |
+
- [`jina-embeddings-v2-base-de`](https://huggingface.co/jinaai/jina-embeddings-v2-base-de): German-English Bilingual embeddings.
|
50 |
+
- [`jina-embeddings-v2-base-es`](https://huggingface.co/jinaai/jina-embeddings-v2-base-es): Spanish-English Bilingual embeddings (soon).
|
51 |
|
52 |
**<details><summary>Supported (Programming) Languages</summary>**
|
53 |
<p>
|