some1nostr commited on
Commit
d78488b
1 Parent(s): 4c7c7eb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -26
README.md CHANGED
@@ -10,59 +10,53 @@ license: apache-2.0
10
 
11
  **Finetuned from model:** [Llama3-70B](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct)
12
 
13
- # Fine Tuned With
14
 
15
- **Nostr notes**:
16
-
17
- This fine tuning makes the model know more about bitcoin and topics on Nostr.
18
-
19
- You can read more about the "LLM curation" via fine tuning with Nostr notes:
20
- [here](https://habla.news/a/naddr1qvzqqqr4gupzp8lvwt2hnw42wu40nec7vw949ys4wgdvums0svs8yhktl8mhlpd3qqgx6cnjvfhhzet20pkhqdn2wenkvu6gy4y) and
21
- [here](https://habla.news/a/naddr1qvzqqqr4gupzp8lvwt2hnw42wu40nec7vw949ys4wgdvums0svs8yhktl8mhlpd3qqxnzde3xumrswfjx56rjwf4kkqhsx).
22
-
23
- **Fine tuned in faith related topics**
24
 
25
  **Aligned a bit in these domains:**
26
- - Health
27
- - Nutrition
28
  - Alternative medicine
29
- - Herbs
30
  - Permaculture
31
  - Phytochemicals
32
 
33
- More to come around these topics.
34
 
35
 
36
  # Uses
37
 
38
- Compared to other models this may know more about Nostr and Bitcoin. It is aligned with opinions of people on Nostr. It may have alternative ideas to mainstream because Nostr is very censorship resistant.
39
-
40
- You can use llama.cpp to use the GGUF file.
41
 
42
- Use repeat penalty of 1.05 or more to avoid repetitions.
 
 
43
 
44
  I am using the model here as a ground truth for Nostr related questions: https://wikifreedia.xyz/based-llm-leaderboard/npub1nlk894teh248w2heuu0x8z6jjg2hyxkwdc8cxgrjtm9lnamlskcsghjm9c
45
 
 
 
 
 
 
 
46
 
47
  # Warning
48
 
49
  Users (both direct and downstream) should be aware of the risks, biases and limitations of the model.
50
  The trainer, developer or uploader of this model does not assume any liability. Use it at your own risk.
 
51
 
52
  # Training Details
53
 
54
  ## Training Data
55
 
56
- Nostr related info from web and nostr itself, bitcoin related info.
57
- Information that aligns well with humanity is preferred.
58
- About 80% comes from Nostr notes. The rest is my own curation.
59
 
60
  ## Training Procedure
61
 
62
  LLaMa-Factory is used to train on 2* RTX 3090! fsdp_qlora is the technique.
63
 
64
- The Nostr training took more than 200 hours.
65
-
66
- After training it for a while with notes, I used it to analyze and decide what notes to take in further for training.
67
-
68
- The number in the filenames (like 12345) means the version. I take the training steps and add those to the version. There has been more than 9k training steps.
 
10
 
11
  **Finetuned from model:** [Llama3-70B](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct)
12
 
13
+ # Pre Trained With
14
 
15
+ - Nostr notes: This makes the model know more about bitcoin and other topics discussed on Nostr.
16
+ - Health related topics
17
+ - Faith related topics
18
+ - Nutrition related topics
19
+ - Medicinal herbs
 
 
 
 
20
 
21
  **Aligned a bit in these domains:**
 
 
22
  - Alternative medicine
 
23
  - Permaculture
24
  - Phytochemicals
25
 
 
26
 
27
 
28
  # Uses
29
 
30
+ You can read more about the "LLM curation" via fine tuning with Nostr notes:
31
+ [here](https://habla.news/a/naddr1qvzqqqr4gupzp8lvwt2hnw42wu40nec7vw949ys4wgdvums0svs8yhktl8mhlpd3qqgx6cnjvfhhzet20pkhqdn2wenkvu6gy4y) and
32
+ [here](https://habla.news/a/naddr1qvzqqqr4gupzp8lvwt2hnw42wu40nec7vw949ys4wgdvums0svs8yhktl8mhlpd3qqxnzde3xumrswfjx56rjwf4kkqhsx).
33
 
34
+ Compared to other models this may know more about Nostr, Bitcoin and healthy living.
35
+ Closer aligment to opinions of people on Nostr. It may have alternative ideas to mainstream because of Nostr and my own curation.
36
+ So it is basically somewhere in between base llama 3.0 plus Nostr plus my values.
37
 
38
  I am using the model here as a ground truth for Nostr related questions: https://wikifreedia.xyz/based-llm-leaderboard/npub1nlk894teh248w2heuu0x8z6jjg2hyxkwdc8cxgrjtm9lnamlskcsghjm9c
39
 
40
+ Use repeat penalty of 1.05 or more to avoid repetitions.
41
+
42
+ I hope you like it. Let me know about your experience. You can DM me on Nostr.
43
+
44
+ The number in the filenames (like 21345) means the version. I take the training steps and add those as the version. Each source, book, note adds to the version.
45
+
46
 
47
  # Warning
48
 
49
  Users (both direct and downstream) should be aware of the risks, biases and limitations of the model.
50
  The trainer, developer or uploader of this model does not assume any liability. Use it at your own risk.
51
+ There is no guarantee that the model will be of any use. It may hallucinate often.
52
 
53
  # Training Details
54
 
55
  ## Training Data
56
 
57
+ The sources mentioned above are converted to TXT files and used as pre training. No PPO, DPO or other method of fine tuning.
 
 
58
 
59
  ## Training Procedure
60
 
61
  LLaMa-Factory is used to train on 2* RTX 3090! fsdp_qlora is the technique.
62