Kendamarron
/

Tokara-0.5B-Chat-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Kendamarron commited on May 8

Commit

306dc5b

•

1 Parent(s): fcaf372

Update README.md

Files changed (1) hide show

README.md +15 -2

README.md CHANGED Viewed

@@ -10,7 +10,13 @@ pipeline_tag: text-generation
 ---
 ## モデルについて
-[Qwen/Qwen1.5-0.5B](https://huggingface.co/Qwen/Qwen1.5-0.5B)を日英データ5Bトークンで継続事前学習した[Tokara-0.5B-v0.1]()にchatvectorで対話能力を加えたモデルになります。
 詳細は[こちら](https://zenn.dev/kendama/articles/55564e12da6e82)をご覧ください。
@@ -19,8 +25,15 @@ pipeline_tag: text-generation
 - 0.24*(Qwen/Qwen1.5-0.5B-Chat - Qwen/Qwen1.5-0.5B)
 - 0.56*(Kendamarron/Tokara-0.5B-Chat-dolly-jimba - Kendamarron/Tokara-0.5B-v0.1)
 ## 名前について
-名前の由来は日本の在来馬であるトカラウマからです。
 ```python
 import torch

 ---
 ## モデルについて
+[Qwen/Qwen1.5-0.5B](https://huggingface.co/Qwen/Qwen1.5-0.5B)を日英データ5Bトークンで継続事前学習した[Tokara-0.5B-v0.1](https://huggingface.co/Kendamarron/Tokara-0.5B-v0.1)にchat vectorで対話能力を加えたモデルになります。
+0.5Bというモデルサイズにしてはコミュニケーションが行えるモデルになっています。
+chat vectorに使ったモデルはマルチターンの学習を行ったモデルになっているので、複数ターンの会話も行えるはずです。
+モデルサイズの問題なのか、repetition_penaltyを1.15～1.25くらいにしないと早めに繰り返しが始まります。
 詳細は[こちら](https://zenn.dev/kendama/articles/55564e12da6e82)をご覧ください。
 - 0.24*(Qwen/Qwen1.5-0.5B-Chat - Qwen/Qwen1.5-0.5B)
 - 0.56*(Kendamarron/Tokara-0.5B-Chat-dolly-jimba - Kendamarron/Tokara-0.5B-v0.1)
+## ベンチマーク
+Japanese MT-benchの6カテゴリをシングルターンで評価
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/63075d83cb09c0a9042a82c2/8Mg54DXeRBFcnF0Xgka68.png)
+| Extraction | Humanities | Reasoning | Roleplay | STEM | Writing |
+| ---------- | ---------- | --------- | -------- | ---- | ------- |
+| 1.3        | 2.6        | 2.5       | 3.8      | 2.3  | 3.2     |
 ## 名前について
+日本の在来馬であるトカラ馬から
 ```python
 import torch