RioShiina commited on
Commit
9495618
·
verified ·
1 Parent(s): 2822277

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +70 -3
README.md CHANGED
@@ -1,3 +1,70 @@
1
- ---
2
- license: llama3.1
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3.1
3
+ language:
4
+ - ja
5
+ - en
6
+ ---
7
+
8
+ **[4.0bpw](https://huggingface.co/rioshiina/Llama-3.1-Swallow-8B-Instruct-v0.2-exl2/tree/4.0bpw)**
9
+ **[6.0bpw](https://huggingface.co/rioshiina/Llama-3.1-Swallow-8B-Instruct-v0.2-exl2/tree/6.0bpw)**
10
+ **[8.0bpw](https://huggingface.co/rioshiina/Llama-3.1-Swallow-8B-Instruct-v0.2-exl2/tree/8.0bpw)**
11
+
12
+ # Llama-3.1-Swallow-8B-Instruct-v0.2-exl2
13
+ - Model creator: [tokyotech-llm](https://huggingface.co/tokyotech-llm)
14
+ - Original model: [Llama-3.1-Swallow-8B-Instruct-v0.2](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.2)
15
+
16
+ ### License
17
+
18
+ [META LLAMA 3.1 COMMUNITY LICENSE](https://www.llama.com/llama3_1/license/)
19
+
20
+ ## Prompt template
21
+
22
+ ```
23
+ <|begin_of_text|><|start_header_id|>system<|end_header_id|>
24
+
25
+ あなたは誠実で優秀な日本人のアシスタントです。<|eot_id|><|start_header_id|>user<|end_header_id|>
26
+
27
+ 東京の紅葉した公園で、東京タワーと高層ビルを背景に、空を舞うツバメと草地に佇むラマが出会う温かな物語を書いてください。<|eot_id|><|start_header_id|>assistant<|end_header_id|>
28
+
29
+ ```
30
+
31
+ ### Citations
32
+
33
+ ```tex
34
+ @inproceedings{Fujii:COLM2024,
35
+ title={Continual Pre-Training for Cross-Lingual LLM Adaptation:
36
+ Enhancing Japanese Language Capabilities},
37
+ author={Kazuki Fujii and Taishi Nakamura and Mengsay Loem and Hiroki
38
+ Iida and Masanari Ohi and Kakeru Hattori and Hirai Shota and Sakae
39
+ Mizuki and Rio Yokota and Naoaki Okazaki},
40
+ booktitle="Proceedings of the First Conference on Language Modeling",
41
+ series={COLM},
42
+ pages="(to appear)",
43
+ year="2024",
44
+ month=oct,
45
+ address={University of Pennsylvania, USA},
46
+ }
47
+
48
+ @inproceedings{Okazaki:COLM2024,
49
+ title={Building a Large Japanese Web Corpus for Large Language Models},
50
+ author={Naoaki Okazaki and Kakeru Hattori and Hirai Shota and Hiroki
51
+ Iida and Masanari Ohi and Kazuki Fujii and Taishi Nakamura and Mengsay
52
+ Loem and Rio Yokota and Sakae Mizuki},
53
+ booktitle="Proceedings of the First Conference on Language Modeling",
54
+ series={COLM},
55
+ pages="(to appear)",
56
+ year="2024",
57
+ month=oct,
58
+ address={University of Pennsylvania, USA},
59
+ }
60
+
61
+ @misc{dubey2024llama3herdmodels,
62
+ title={The Llama 3 Herd of Models},
63
+ author={Abhimanyu Dubey and Abhinav Jauhri and Abhinav Pandey and Abhishek Kadian and Ahmad Al-Dahle and Aiesha Letman and Akhil Mathur and Alan Schelten and Amy Yang and Angela Fan et al.},
64
+ year={2024},
65
+ eprint={2407.21783},
66
+ archivePrefix={arXiv},
67
+ primaryClass={cs.AI},
68
+ url={https://arxiv.org/abs/2407.21783},
69
+ }
70
+ ```