aixsatoshi
/

Honyaku-7b-v2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

aixsatoshi commited on Apr 8

Commit

764ca7f

•

1 Parent(s): 311cc5a

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -9,7 +9,7 @@ Honyaku-7b-v2 is an improved version of its predecessor. This model exhibits enh
 * Improved Multilingual Generation Accuracy: The model has increased precision in following multilingual generation tags.
 * Quality-Reflective Translation: The translation quality of Honyaku-7b is strongly influenced by the pre-training of the base model. Consequently, the quality of translation varies in proportion to the training volume of the original language model.
-* The primary purpose is to translate several hundreds to several thousand tokens. Due to the characteristics of the Base model, translation into Japanese is the most stable.
 * It has been fine-tuned up to 8k tokens, but based on the Base model's characteristics, it supports up to 4k tokens including the prompt.
 **Cautions:**
@@ -26,7 +26,8 @@ Honyaku-7b-v2は、前バージョンの改良版です。このモデルは、
 * 多言語生成の精度向上： モデルは、多言語生成タグに対する追従の精度が向上しました。
 * 翻訳品質の反映： Honyaku-7bの翻訳品質は、ベースモデルの事前学習に強く影響されます。翻訳品質は、元の言語モデルの学習量に比例して変わります。
-* 数100～数1000 tokenの翻訳を主目的としています。Base modelの特徴から、日本語への翻訳が最も安定しています。
 * 8k tokenまでファインチューニングしていますが、Base modelの特徴からprompt含めて4k tokenにまで対応とします。
 **注意点：**

 * Improved Multilingual Generation Accuracy: The model has increased precision in following multilingual generation tags.
 * Quality-Reflective Translation: The translation quality of Honyaku-7b is strongly influenced by the pre-training of the base model. Consequently, the quality of translation varies in proportion to the training volume of the original language model.
+* The primary purpose is to translate about 500 to several thousand tokens. Due to the characteristics of the Base model, translation into Japanese is the most stable.
 * It has been fine-tuned up to 8k tokens, but based on the Base model's characteristics, it supports up to 4k tokens including the prompt.
 **Cautions:**
 * 多言語生成の精度向上： モデルは、多言語生成タグに対する追従の精度が向上しました。
 * 翻訳品質の反映： Honyaku-7bの翻訳品質は、ベースモデルの事前学習に強く影響されます。翻訳品質は、元の言語モデルの学習量に比例して変わります。
+* 500～数1000 tokenの翻訳を主目的としています。短すぎる文、長すぎる文で性能低下。
+* Base modelの特徴から、日本語への翻訳が最も安定しています。
 * 8k tokenまでファインチューニングしていますが、Base modelの特徴からprompt含めて4k tokenにまで対応とします。
 **注意点：**