DuckyBlender commited on
Commit
1fe49dc
1 Parent(s): 50bec4c

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +45 -0
README.md ADDED
@@ -0,0 +1,45 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: gpl-3.0
3
+ datasets:
4
+ - CohereForAI/aya_dataset
5
+ language:
6
+ - pl
7
+ tags:
8
+ - lobotomy
9
+ ---
10
+
11
+ **Polish-Lobotomy: An awful polish fine-tune**
12
+ ============================================================
13
+
14
+
15
+ **Model Description**
16
+ ---------------
17
+
18
+ This fine-tuned Phi-3 model is the first attempt at a Polish fine-tune of Phi-3. It is very bad, probably because of the fine-tuning method (making the model learn a new language probably needs a full fine-tune) and the small dataset.
19
+ - Ollama: [https://ollama.com/duckyblender/polish-lobotomy](https://ollama.com/duckyblender/polish-lobotomy)
20
+
21
+ **Training Details**
22
+ -----------------
23
+
24
+ * Trained on a single RTX 4060 for approximately 1 hour
25
+ * Utilized 8-bit QLORA for efficient training
26
+ * Despite the short training period, the model somehow managed to learn something (but not very well)
27
+
28
+ ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/6317acd6212fce5a3cd793f6/KnxTL_Ww3aYUrJz8kZ5Sz.jpeg)
29
+
30
+ **Dataset**
31
+ ------------
32
+
33
+ The model was trained on the Polish subset of the AYA dataset, which can be found at [https://huggingface.co/datasets/CohereForAI/aya_dataset](https://huggingface.co/datasets/CohereForAI/aya_dataset).
34
+
35
+ **Prompt Template**
36
+ -----------------
37
+
38
+ The prompt template used for this model is identical to the Phi 3 template.
39
+
40
+ **Disclaimer**
41
+ --------------
42
+
43
+ **Please be advised that this model's output may contain nonsensical responses. Viewer discretion is strongly advised (but not really necessary).**
44
+
45
+ Use this model at your own risk, and please engage with the output responsibly (but let's be real, it's not like it's going to be useful for anything).