Word2vec
/

wikipedia2vec_arwiki_20180420_100d

Model card Files Files and versions Community

lbourdois commited on Jul 8, 2023

Commit

24f41da

•

1 Parent(s): b1ae634

Update README.md

Files changed (1) hide show

README.md +24 -11

README.md CHANGED Viewed

@@ -1,19 +1,32 @@
 ---
-tags:
-- word2vec
-language: ar
 license: apache-2.0
 ---
-## Citation Information
 ```
-@inproceedings{yamada2016joint,
-  title={Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation},
-  author={Yamada, Ikuya and Shindo, Hiroyuki and Takeda, Hideaki and Takefuji, Yoshiyasu},
-  booktitle={Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning},
-  year={2016},
-  publisher={Association for Computational Linguistics},
-  pages={250--259}
 }
 ```

 ---
 license: apache-2.0
+tags:
+  - word2vec
+datasets:
+- wikipedia
+language:
+- ar
 ---
+## Information
+Pretrained Word2vec in Arabic. For more information, see [https://wikipedia2vec.github.io/wikipedia2vec/pretrained/](https://wikipedia2vec.github.io/wikipedia2vec/pretrained/).
+## How to use?
 ```
+from gensim.models import KeyedVectors
+from huggingface_hub import hf_hub_download
+model = KeyedVectors.load_word2vec_format(hf_hub_download(repo_id="Word2vec/wikipedia2vec_arwiki_20180420_100d", filename="arwiki_20180420_100d.txt"))
+model.most_similar("your_word")
+```
+## Citation
+```
+@inproceedings{yamada2020wikipedia2vec,
+  title = "{W}ikipedia2{V}ec: An Efficient Toolkit for Learning and Visualizing the Embeddings of Words and Entities from {W}ikipedia",
+  author={Yamada, Ikuya and Asai, Akari and Sakuma, Jin and Shindo, Hiroyuki and Takeda, Hideaki and Takefuji, Yoshiyasu and Matsumoto, Yuji},
+  booktitle = {Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations},
+  year = {2020},
+  publisher = {Association for Computational Linguistics},
+  pages = {23--30}
 }
 ```