xaviviro commited on
Commit
4d56008
1 Parent(s): b9d3bac

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +79 -0
README.md ADDED
@@ -0,0 +1,79 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: openlm-research/open_llama_3b_v2
4
+ datasets:
5
+ - xaviviro/oasst2_euskera_gpt
6
+ tags:
7
+ - finetune
8
+ - chatml
9
+ - gpt4
10
+ - catalan
11
+ model-index:
12
+ - name: SUGARRA-3B
13
+ results: []
14
+ library_name: transformers
15
+ widget:
16
+ - text: "<|im_start|>user\nNor zen Isaac Newton?<|im_end|>\n<|im_start|>assistant\n"
17
+ language:
18
+ - eu
19
+ - en
20
+ ---
21
+
22
+ # SUGARRA: Modelo 3B experimental ChatML euskaldun.
23
+
24
+
25
+ ![FLAMA](flama.png)
26
+
27
+ SUGARRA es el resultado de finetunear el modelo [open_llama_3b_v2](/openlm-research/open_llama_3b_v2) con las instrucciones [OpenAssistant v2](/datasets/OpenAssistant/oasst2) traducidas automáticamente al euskera usando recursos de [Helsinki-NLP](/Helsinki-NLP) y tratadas en formato ChatML.
28
+
29
+
30
+ # Prompt Template
31
+
32
+ SUGARRA usa el prompt template **ChatML**:
33
+
34
+ ```
35
+ <|im_start|>user
36
+ Nor zen Isaac Newton?<|im_end|>
37
+ <|im_start|>assistant\n
38
+ ```
39
+
40
+ [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
41
+
42
+ ## Referències
43
+
44
+ ```
45
+ @software{xaviviro2024sugarra,
46
+ author = {xaviviro},
47
+ title = {SUGARRA: Modelo 3B experimental ChatML euskaldun.},
48
+ month = January,
49
+ year = 2024,
50
+ url = {https://huggingface.co/xaviviro/SUGARRA-3B}
51
+ }
52
+ ```
53
+
54
+ ```
55
+ @software{openlm2023openllama,
56
+ author = {Geng, Xinyang and Liu, Hao},
57
+ title = {OpenLLaMA: An Open Reproduction of LLaMA},
58
+ month = May,
59
+ year = 2023,
60
+ url = {https://github.com/openlm-research/open_llama}
61
+ }
62
+ ```
63
+ ```
64
+ @software{together2023redpajama,
65
+ author = {Together Computer},
66
+ title = {RedPajama-Data: An Open Source Recipe to Reproduce LLaMA training dataset},
67
+ month = April,
68
+ year = 2023,
69
+ url = {https://github.com/togethercomputer/RedPajama-Data}
70
+ }
71
+ ```
72
+ ```
73
+ @article{touvron2023llama,
74
+ title={Llama: Open and efficient foundation language models},
75
+ author={Touvron, Hugo and Lavril, Thibaut and Izacard, Gautier and Martinet, Xavier and Lachaux, Marie-Anne and Lacroix, Timoth{\'e}e and Rozi{\`e}re, Baptiste and Goyal, Naman and Hambro, Eric and Azhar, Faisal and others},
76
+ journal={arXiv preprint arXiv:2302.13971},
77
+ year={2023}
78
+ }
79
+ ```