Mizuiro-sakura
/

Luke-finetuned-sentiment-analysis

sentiment-analysis

Model card Files Files and versions Community

Mizuiro-sakura commited on Dec 27, 2022

Commit

61f41b6

•

1 Parent(s): e8b94f5

Update README.md

Files changed (1) hide show

README.md +31 -0

README.md CHANGED Viewed

@@ -6,6 +6,8 @@ license: mit
 夏目漱石さんの文章（こころ、坊ちゃん、三四郎、etc）を日本語極性辞書
 （　http://www.cl.ecei.tohoku.ac.jp/Open_Resources-Japanese_Sentiment_Polarity_Dictionary.html　）
 を用いてポジティブ・ネガティブ判定したものを教師データとしてモデルの学習を行いました。
 # This model is based on Luke-japanese-base-lite
 This model was fine-tuned model which besed on studio-ousia/Luke-japanese-base-lite.
@@ -19,6 +21,35 @@ LUKE (Language Understanding with Knowledge-based Embeddings) is a new pre-train
 LUKE achieves state-of-the-art results on five popular NLP benchmarks including SQuAD v1.1 (extractive question answering), CoNLL-2003 (named entity recognition), ReCoRD (cloze-style question answering), TACRED (relation classification), and Open Entity (entity typing).
 # how to use 使い方
 # Citation
 [1]@inproceedings{yamada2020luke,

 夏目漱石さんの文章（こころ、坊ちゃん、三四郎、etc）を日本語極性辞書
 （　http://www.cl.ecei.tohoku.ac.jp/Open_Resources-Japanese_Sentiment_Polarity_Dictionary.html　）
 を用いてポジティブ・ネガティブ判定したものを教師データとしてモデルの学習を行いました。
+比較的長い文章（30語以上）において高い精度を発揮します。（単語など短い文章では低い正答率であることが確認されています。）
+また使用した教師データから、口語より文語に対して高い正答率となることが期待されます。
 # This model is based on Luke-japanese-base-lite
 This model was fine-tuned model which besed on studio-ousia/Luke-japanese-base-lite.
 LUKE achieves state-of-the-art results on five popular NLP benchmarks including SQuAD v1.1 (extractive question answering), CoNLL-2003 (named entity recognition), ReCoRD (cloze-style question answering), TACRED (relation classification), and Open Entity (entity typing).
 # how to use 使い方
+-------------------------------------------------------------
+import torch
+from transformers import MLukeTokenizer
+from torch import nn
+tokenizer = MLukeTokenizer.from_pretrained('studio-ousia/luke-japanese-base-lite')
+model = torch.load('C:\\[My_luke_model_pn.pthのあるディレクトリ]\\My_luke_model_pn.pth')
+text=input()
+encoded_dict = tokenizer.encode_plus(
+                        text,
+                        return_attention_mask = True,   # Attention maksの作成
+                        return_tensors = 'pt',     #  Pytorch tensorsで返す
+                )
+pre = model(encoded_dict['input_ids'], token_type_ids=None, attention_mask=encoded_dict['attention_mask'])
+SOFTMAX=nn.Softmax(dim=0)
+num=SOFTMAX(pre.logits[0])
+if num[1]>0.5:
+    print(str(num[1]))
+    print('ポジティブ')
+else:
+    print(str(num[1]))
+    print('ネガティブ')
+-------------------------------------------------------------
 # Citation
 [1]@inproceedings{yamada2020luke,