some1nostr
commited on
Commit
•
d78488b
1
Parent(s):
4c7c7eb
Update README.md
Browse files
README.md
CHANGED
@@ -10,59 +10,53 @@ license: apache-2.0
|
|
10 |
|
11 |
**Finetuned from model:** [Llama3-70B](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct)
|
12 |
|
13 |
-
#
|
14 |
|
15 |
-
|
16 |
-
|
17 |
-
|
18 |
-
|
19 |
-
|
20 |
-
[here](https://habla.news/a/naddr1qvzqqqr4gupzp8lvwt2hnw42wu40nec7vw949ys4wgdvums0svs8yhktl8mhlpd3qqgx6cnjvfhhzet20pkhqdn2wenkvu6gy4y) and
|
21 |
-
[here](https://habla.news/a/naddr1qvzqqqr4gupzp8lvwt2hnw42wu40nec7vw949ys4wgdvums0svs8yhktl8mhlpd3qqxnzde3xumrswfjx56rjwf4kkqhsx).
|
22 |
-
|
23 |
-
**Fine tuned in faith related topics**
|
24 |
|
25 |
**Aligned a bit in these domains:**
|
26 |
-
- Health
|
27 |
-
- Nutrition
|
28 |
- Alternative medicine
|
29 |
-
- Herbs
|
30 |
- Permaculture
|
31 |
- Phytochemicals
|
32 |
|
33 |
-
More to come around these topics.
|
34 |
|
35 |
|
36 |
# Uses
|
37 |
|
38 |
-
|
39 |
-
|
40 |
-
|
41 |
|
42 |
-
|
|
|
|
|
43 |
|
44 |
I am using the model here as a ground truth for Nostr related questions: https://wikifreedia.xyz/based-llm-leaderboard/npub1nlk894teh248w2heuu0x8z6jjg2hyxkwdc8cxgrjtm9lnamlskcsghjm9c
|
45 |
|
|
|
|
|
|
|
|
|
|
|
|
|
46 |
|
47 |
# Warning
|
48 |
|
49 |
Users (both direct and downstream) should be aware of the risks, biases and limitations of the model.
|
50 |
The trainer, developer or uploader of this model does not assume any liability. Use it at your own risk.
|
|
|
51 |
|
52 |
# Training Details
|
53 |
|
54 |
## Training Data
|
55 |
|
56 |
-
|
57 |
-
Information that aligns well with humanity is preferred.
|
58 |
-
About 80% comes from Nostr notes. The rest is my own curation.
|
59 |
|
60 |
## Training Procedure
|
61 |
|
62 |
LLaMa-Factory is used to train on 2* RTX 3090! fsdp_qlora is the technique.
|
63 |
|
64 |
-
The Nostr training took more than 200 hours.
|
65 |
-
|
66 |
-
After training it for a while with notes, I used it to analyze and decide what notes to take in further for training.
|
67 |
-
|
68 |
-
The number in the filenames (like 12345) means the version. I take the training steps and add those to the version. There has been more than 9k training steps.
|
|
|
10 |
|
11 |
**Finetuned from model:** [Llama3-70B](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct)
|
12 |
|
13 |
+
# Pre Trained With
|
14 |
|
15 |
+
- Nostr notes: This makes the model know more about bitcoin and other topics discussed on Nostr.
|
16 |
+
- Health related topics
|
17 |
+
- Faith related topics
|
18 |
+
- Nutrition related topics
|
19 |
+
- Medicinal herbs
|
|
|
|
|
|
|
|
|
20 |
|
21 |
**Aligned a bit in these domains:**
|
|
|
|
|
22 |
- Alternative medicine
|
|
|
23 |
- Permaculture
|
24 |
- Phytochemicals
|
25 |
|
|
|
26 |
|
27 |
|
28 |
# Uses
|
29 |
|
30 |
+
You can read more about the "LLM curation" via fine tuning with Nostr notes:
|
31 |
+
[here](https://habla.news/a/naddr1qvzqqqr4gupzp8lvwt2hnw42wu40nec7vw949ys4wgdvums0svs8yhktl8mhlpd3qqgx6cnjvfhhzet20pkhqdn2wenkvu6gy4y) and
|
32 |
+
[here](https://habla.news/a/naddr1qvzqqqr4gupzp8lvwt2hnw42wu40nec7vw949ys4wgdvums0svs8yhktl8mhlpd3qqxnzde3xumrswfjx56rjwf4kkqhsx).
|
33 |
|
34 |
+
Compared to other models this may know more about Nostr, Bitcoin and healthy living.
|
35 |
+
Closer aligment to opinions of people on Nostr. It may have alternative ideas to mainstream because of Nostr and my own curation.
|
36 |
+
So it is basically somewhere in between base llama 3.0 plus Nostr plus my values.
|
37 |
|
38 |
I am using the model here as a ground truth for Nostr related questions: https://wikifreedia.xyz/based-llm-leaderboard/npub1nlk894teh248w2heuu0x8z6jjg2hyxkwdc8cxgrjtm9lnamlskcsghjm9c
|
39 |
|
40 |
+
Use repeat penalty of 1.05 or more to avoid repetitions.
|
41 |
+
|
42 |
+
I hope you like it. Let me know about your experience. You can DM me on Nostr.
|
43 |
+
|
44 |
+
The number in the filenames (like 21345) means the version. I take the training steps and add those as the version. Each source, book, note adds to the version.
|
45 |
+
|
46 |
|
47 |
# Warning
|
48 |
|
49 |
Users (both direct and downstream) should be aware of the risks, biases and limitations of the model.
|
50 |
The trainer, developer or uploader of this model does not assume any liability. Use it at your own risk.
|
51 |
+
There is no guarantee that the model will be of any use. It may hallucinate often.
|
52 |
|
53 |
# Training Details
|
54 |
|
55 |
## Training Data
|
56 |
|
57 |
+
The sources mentioned above are converted to TXT files and used as pre training. No PPO, DPO or other method of fine tuning.
|
|
|
|
|
58 |
|
59 |
## Training Procedure
|
60 |
|
61 |
LLaMa-Factory is used to train on 2* RTX 3090! fsdp_qlora is the technique.
|
62 |
|
|
|
|
|
|
|
|
|
|