grapevine-AI
/

Athene-V2-Chat-GGUF

Inference Endpoints

Model card Files Files and versions Community

grapevine-AI commited on Nov 17, 2024

Commit

5ab48ce

·

verified ·

1 Parent(s): 7ad78b9

Update README.md

Files changed (1) hide show

README.md +31 -5

README.md CHANGED Viewed

@@ -1,5 +1,31 @@
----
-license: other
-license_name: nexusflow-research-license
-license_link: LICENSE
----

+---
+license: other
+---
+# What is this?
+NexusflowのAthene-70Bの次世代モデル[Athene-V2-Chat](https://huggingface.co/Nexusflow/Athene-V2-Chat)を日本語imatrixで量子化したものです。<br>
+今回からはQwen2.5-72B-Instructベースに切り替わり、Chat用モデルとAgent用モデルの2種類が用意されるようになりました。<br>
+なお、**商用利用はできません**のでご注意ください。
+# imatrix dataset
+日本語能力を重視し、日本語が多量に含まれる[TFMC/imatrix-dataset-for-japanese-llm](https://huggingface.co/datasets/TFMC/imatrix-dataset-for-japanese-llm)データセットを使用しました。<br>
+なお、計算リソースの関係上imatrixの算出においてはQ8_0量子化モデルを使用しました。
+# Chat template
+```
+<|im_start|>system
+ここにSystem Promptを書きます。<|im_end|>
+<|im_start|>user
+ここにMessageを書きます。<|im_end|>
+<|im_start|>assistant
+```
+# Environment
+Windows版llama.cpp-b3621およびllama.cpp-b3472同時リリースのconvert-hf-to-gguf.pyを使用して量子化作業を実施しました。
+# License
+Qwen LICENSE & Nexusflow Research License
+# Developer
+Alibaba Cloud & Nexusflow