studio-ousia
/

luke-japanese-large-lite

named entity recognition

relation classification

question answering

Inference Endpoints

Model card Files Files and versions Community

ikuyamada commited on Nov 9, 2022

Commit

2371c49

•

1 Parent(s): bda8a52

update README

Files changed (1) hide show

README.md +56 -0

README.md CHANGED Viewed

@@ -1,3 +1,59 @@
 ---
 license: apache-2.0
 ---

 ---
+language: ja
+thumbnail: https://github.com/studio-ousia/luke/raw/master/resources/luke_logo.png
+tags:
+  - luke
+  - named entity recognition
+  - entity typing
+  - relation classification
+  - question answering
 license: apache-2.0
 ---
+## luke-japanese-large-lite
+**luke-japanese** is the Japanese version of **LUKE** (**L**anguage
+**U**nderstanding with **K**nowledge-based **E**mbeddings), a pre-trained
+_knowledge-enhanced_ contextualized representation of words and entities. LUKE
+treats words and entities in a given text as independent tokens, and outputs
+contextualized representations of them. Please refer to our
+[GitHub repository](https://github.com/studio-ousia/luke) for more details and
+updates.
+This model is a lightweight version which does not contain Wikipedia entity
+embeddings. Please use the
+[full version](https://huggingface.co/studio-ousia/luke-japanese-large/) for
+tasks that use Wikipedia entities as inputs.
+**luke-japanese**は、単語とエンティティの知識拡張型訓練済み Transformer モデル**LUKE**の日本語版です。LUKE は単語とエンティティを独立したトークンとして扱い、これらの文脈を考慮した表現を出力します。詳細については、[GitHub リポジトリ](https://github.com/studio-ousia/luke)を参照してください。
+このモデルは、Wikipedia エンティティのエンベディングを含まない軽量版のモデルです。Wikipedia エンティティを入力として使うタスクには、[full version](https://huggingface.co/studio-ousia/luke-japanese-large/)を使用してください。
+### Experimental results on JGLUE
+The experimental results evaluated on the dev set of
+[JGLUE](https://github.com/yahoojapan/JGLUE) is shown as follows:
+| Model                         | MARC-ja   | JSTS                | JNLI      | JCommonsenseQA |
+| ----------------------------- | --------- | ------------------- | --------- | -------------- |
+|                               | acc       | Pearson/Spearman    | acc       | acc            |
+| **LUKE Japanese large**       | **0.965** | **0.932**/**0.902** | **0.927** | 0.893          |
+| _Baselines:_                  |           |
+| Tohoku BERT large             | 0.955     | 0.913/0.872         | 0.900     | 0.816          |
+| Waseda RoBERTa large (seq128) | 0.954     | 0.930/0.896         | 0.924     | **0.907**      |
+| Waseda RoBERTa large (seq512) | 0.961     | 0.926/0.892         | 0.926     | 0.891          |
+| XLM RoBERTa large             | 0.964     | 0.918/0.884         | 0.919     | 0.840          |
+The baseline scores are obtained from
+[here](https://github.com/yahoojapan/JGLUE/blob/a6832af23895d6faec8ecf39ec925f1a91601d62/README.md).
+### Citation
+```latex
+@inproceedings{yamada2020luke,
+  title={LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention},
+  author={Ikuya Yamada and Akari Asai and Hiroyuki Shindo and Hideaki Takeda and Yuji Matsumoto},
+  booktitle={EMNLP},
+  year={2020}
+}
+```