lbourdois commited on
Commit
24f41da
1 Parent(s): b1ae634

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -11
README.md CHANGED
@@ -1,19 +1,32 @@
1
  ---
2
- tags:
3
- - word2vec
4
- language: ar
5
  license: apache-2.0
 
 
 
 
 
 
6
  ---
7
 
8
- ## Citation Information
 
9
 
 
10
  ```
11
- @inproceedings{yamada2016joint,
12
- title={Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation},
13
- author={Yamada, Ikuya and Shindo, Hiroyuki and Takeda, Hideaki and Takefuji, Yoshiyasu},
14
- booktitle={Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning},
15
- year={2016},
16
- publisher={Association for Computational Linguistics},
17
- pages={250--259}
 
 
 
 
 
 
 
 
18
  }
19
  ```
 
1
  ---
 
 
 
2
  license: apache-2.0
3
+ tags:
4
+ - word2vec
5
+ datasets:
6
+ - wikipedia
7
+ language:
8
+ - ar
9
  ---
10
 
11
+ ## Information
12
+ Pretrained Word2vec in Arabic. For more information, see [https://wikipedia2vec.github.io/wikipedia2vec/pretrained/](https://wikipedia2vec.github.io/wikipedia2vec/pretrained/).
13
 
14
+ ## How to use?
15
  ```
16
+ from gensim.models import KeyedVectors
17
+ from huggingface_hub import hf_hub_download
18
+ model = KeyedVectors.load_word2vec_format(hf_hub_download(repo_id="Word2vec/wikipedia2vec_arwiki_20180420_100d", filename="arwiki_20180420_100d.txt"))
19
+ model.most_similar("your_word")
20
+ ```
21
+
22
+ ## Citation
23
+ ```
24
+ @inproceedings{yamada2020wikipedia2vec,
25
+ title = "{W}ikipedia2{V}ec: An Efficient Toolkit for Learning and Visualizing the Embeddings of Words and Entities from {W}ikipedia",
26
+ author={Yamada, Ikuya and Asai, Akari and Sakuma, Jin and Shindo, Hiroyuki and Takeda, Hideaki and Takefuji, Yoshiyasu and Matsumoto, Yuji},
27
+ booktitle = {Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations},
28
+ year = {2020},
29
+ publisher = {Association for Computational Linguistics},
30
+ pages = {23--30}
31
  }
32
  ```