sangdv commited on
Commit
dfb7bc5
1 Parent(s): bc7a1e5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -25,9 +25,10 @@ license: apache-2.0
25
  # bkai-foundation-models/vietnamese-bi-encoder
26
 
27
  This is a [sentence-transformers](https://www.SBERT.net) model: It maps sentences & paragraphs to a 768 dimensional dense vector space and can be used for tasks like clustering or semantic search.
 
28
  We train the model on a merged training dataset that consists of:
29
- - MS Macro (translated in Vietnamese)
30
- - SQuAD v2 (translated in Vietnamese)
31
  - 80% of the training set from the Legal Text Retrieval Zalo 2021 challenge
32
 
33
  We use [phobert-base-v2](https://github.com/VinAIResearch/PhoBERT) as the pre-trained backbone.
 
25
  # bkai-foundation-models/vietnamese-bi-encoder
26
 
27
  This is a [sentence-transformers](https://www.SBERT.net) model: It maps sentences & paragraphs to a 768 dimensional dense vector space and can be used for tasks like clustering or semantic search.
28
+
29
  We train the model on a merged training dataset that consists of:
30
+ - MS Macro (translated into Vietnamese)
31
+ - SQuAD v2 (translated into Vietnamese)
32
  - 80% of the training set from the Legal Text Retrieval Zalo 2021 challenge
33
 
34
  We use [phobert-base-v2](https://github.com/VinAIResearch/PhoBERT) as the pre-trained backbone.