Sigurdur commited on
Commit
dd3d489
1 Parent(s): 799f585

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -4
README.md CHANGED
@@ -5,14 +5,24 @@ tags:
5
  - feature-extraction
6
  - sentence-similarity
7
  - transformers
8
-
 
9
  ---
10
 
11
- # {MODEL_NAME}
12
 
13
  This is a [sentence-transformers](https://www.SBERT.net) model: It maps sentences & paragraphs to a 768 dimensional dense vector space and can be used for tasks like clustering or semantic search.
14
 
15
- <!--- Describe your model here -->
 
 
 
 
 
 
 
 
 
16
 
17
  ## Usage (Sentence-Transformers)
18
 
@@ -120,4 +130,5 @@ SentenceTransformer(
120
 
121
  ## Citing & Authors
122
 
123
- <!--- Describe where people can find more information -->
 
 
5
  - feature-extraction
6
  - sentence-similarity
7
  - transformers
8
+ language:
9
+ - is
10
  ---
11
 
12
+ # Icelandic SBERT for Sentence Embedding
13
 
14
  This is a [sentence-transformers](https://www.SBERT.net) model: It maps sentences & paragraphs to a 768 dimensional dense vector space and can be used for tasks like clustering or semantic search.
15
 
16
+ ## Data
17
+
18
+ The model was trained on 300 000 sentences, selected at random from clarin-is: [unanotated news2 from IGC(RMH)](https://repository.clarin.is/repository/xmlui/handle/20.500.12537/238)
19
+
20
+
21
+ to install the data, run the following command:
22
+
23
+ ```bash
24
+ curl --remote-name-all https://repository.clarin.is/repository/xmlui/bitstream/handle/20.500.12537/238{/IGC-News2-22.10.TEI.zip}
25
+ ```
26
 
27
  ## Usage (Sentence-Transformers)
28
 
 
130
 
131
  ## Citing & Authors
132
 
133
+ <!--- Describe where people can find more information -->
134
+ Sigurdur Haukur Birgisson