ai-net commited on
Commit
94b3cb2
1 Parent(s): 0913edc

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -0
README.md ADDED
@@ -0,0 +1,23 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+ Release: Oct 7, 2024
5
+
6
+ # Llama-JPSFT
7
+
8
+ Supervised fine-tuning was performed on meta-llama/Llama-3.1-8B-Instruct on a select ~140,000 query-response pairs from a diverse corpus of anonymized, scraped chat data, with a priority on casual conversation. BF16 mixed precision training was executed on NVIDIA A100 Tensor Core GPUs and precision reduction/quantization from safetensors to GGUF was then completed for q8_0, q6_k, and q4_k_m models. This project targets standard SFT as well as instruction-tuning for GPT-based architectures to generate significantly improved coherent and context-aware responses in multi-speaker conversations in casual Japanese.
9
+
10
+ 約14万件の多様な匿名化されたチャットデータを基に、教師あり学習による微調整が実施されました。
11
+
12
+ Instruct Model: https://huggingface.co/ai-net/Llama-JPSFT-2.0
13
+
14
+ Precision Reduction and Quantization: https://huggingface.co/ai-net/Llama-JPSFT-2.0-GGUF
15
+
16
+
17
+ # 例えば:
18
+
19
+ {{user}}\
20
+ 軽率に話しかけてくれる人が増えて嬉しいです!
21
+
22
+ llama-jpsft-2.0-q4_k_m.gguf\
23
+ それはいいことだね