Severian commited on
Commit
d04528b
β€’
1 Parent(s): 41ff2e8

Upload README (11).md

Browse files
Files changed (1) hide show
  1. README (11).md +127 -0
README (11).md ADDED
@@ -0,0 +1,127 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ datasets:
4
+ - Severian/Biomimicry
5
+ - emrgnt-cmplxty/sciphi-textbooks-are-all-you-need
6
+ - fmars/wiki_stem
7
+ - fblgit/tree-of-knowledge
8
+ - Severian/Bio-Design-Process
9
+ tags:
10
+ - chemistry
11
+ - biology
12
+ - climate
13
+ - science
14
+ - philosophy
15
+ - nature
16
+ - ecology
17
+ - biomimicry
18
+ - fauna
19
+ - flora
20
+ pipeline_tag: text-generation
21
+ metrics:
22
+ - accuracy
23
+ ---
24
+ # ANIMA-Phi-Neptune-Mistral-7B: Biomimicry Enhanced LLM
25
+
26
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/64740cf7485a7c8e1bd51ac9/JZH6p50t_j3-OUph4Wq6y.png" width="500">
27
+
28
+ ## Overview
29
+
30
+ **ANIMA** (Advanced Nature Inspired Multidisciplinary Assistant) is an expert in various scientific disciplines, including but not limited to biomimicry, biology, and environmental science.
31
+
32
+ ---
33
+
34
+ ## Model Description
35
+
36
+ ANIMA is fine-tuned on a rich dataset encompassing:
37
+
38
+ - 4,000+ Nature-Biomimicry examples
39
+ - 60k Biomimicry Design Process examples
40
+ - 600k STEM facts from Wikipedia
41
+ - Science/Philosophy focused 'All-You-Need-Is-Textbooks' dataset
42
+ - Additional Tree of Knowledge + Biomimicry data combined fine-tuning
43
+
44
+ The model aims to assist users in solving problems using nature-inspired strategies and concepts.
45
+
46
+ ### Special Features
47
+
48
+ - **Multi-disciplinary Expertise**: Knowledge across various scientific and philosophical domains.
49
+ - **Biomimicry Design Process**: Incorporates a dataset generated by Mistral and Minotaur-15B. The dataset was then intricately processed by a real person to ensure factuality and grounding.
50
+
51
+ ---
52
+
53
+ - Here is a link to The Bloke's GGUF version: [ANIMA-Phi-Neptune-Mistral-7B-GGUF](https://huggingface.co/TheBloke/ANIMA-Phi-Neptune-Mistral-7B-GGUF)
54
+
55
+ - ANIMA is also available using Ollama - Download the App [OLLAMA](https://ollama.ai/) and then run this command in your Terminal 'ollama pull severian/anima' to download the model and then run this command 'ollama run severian/anima'
56
+
57
+ - You can also download and use the model with [LM Studio](https://lmstudio.ai/). Just download the app and then search for 'ANIMA GGUF' in the search bar and you will have a list of versions to choose from.
58
+
59
+ ## Datasets
60
+
61
+ - `Severian/Biomimicry` (1st Fine-Tune)
62
+ - `emrgnt-cmplxty/sciphi-textbooks-are-all-you-need` (2nd Fine-Tune)
63
+ - `Severian/Bio-Design-Process` & `fmars/wiki_stem` (3rd Fine-Tune)
64
+ - `fblgit/tree-of-knowledge & `Severian/Biomimicry` (4th Fine-Tune)
65
+ - Combined all previous datasets into one giant dataset for a quick fine-tuning for a total of 730k data points (5th Fine-Tune)
66
+
67
+ ## Training Stages
68
+
69
+ 1. **Base Model**: `ehartford/dolphin-2.0-mistral-7b`
70
+ 2. **1st Fine-Tune**: `ANIMA-Echo-Mistral-7B-v1` - 1 Epoch x 5hrs on 1 x T4 Small
71
+ 3. **2nd Fine-Tune**: `ANIMA-Phi-Neptune-Mistral-v1`-: 1 Epoch x 1hr on 1 x A100
72
+ 4. **3rd Fine-Tune**: `ANIMA-Phi-Neptune-Mistral-7B-v2` - 1 Epoch x 1hr on 1 x A100
73
+ 5. **4th Fine-Tune**: 'ANIMA-Phi-Neptune-Mistral-7B-v3' - 1 Epoch x 1hr on 1 x A100
74
+ 6. **5th Fine-Tune**: 'ANIMA-Phi-Neptune-Mistral-7B-v4' - 1 Epoch x 15mins on 1 x A100
75
+
76
+ ---
77
+
78
+ ## πŸ“ Usage
79
+
80
+ ANIMA is designed around the Biomimicry Design Process to guide users in solving problems through biomimicry. For more information and examples, please visit [Biomimicry Design Process](https://toolbox.biomimicry.org/methods/process/).
81
+
82
+ ## 🏁 Benchmarks
83
+
84
+ - Average - 62.22
85
+ - ARC - 56.83
86
+ - HellaSwag - 78.82
87
+ - MMLU - 53.84
88
+ - TruthfulQA - 59.40
89
+
90
+ ---
91
+
92
+ - **EXAMPLE OUTPUT**
93
+
94
+ [INST] How can biomimicry help in water purification? [/INST]
95
+
96
+ Biomimicry involves taking inspiration from natural processes and organisms to solve human problems. In the context of water purification, biomimicry can help develop sustainable and efficient methods for cleaning water. One specific example of a nature-inspired strategy to solve this problem is the use of biofiltration, which involves using microorganisms to remove pollutants from water.
97
+
98
+ Biofiltration has been successfully applied to the removal of various contaminants, including heavy metals, organic pollutants, and pathogens. For example, a study by Zhang et al. (2011) demonstrated that a mixed culture of Pseudomonas putida and Bacillus subtilis could effectively remove lead and cadmium from contaminated water. The researchers found that the biofilm formed by the mixed culture had a higher removal efficiency than that formed by either species alone. This study highlights the potential of biofiltration to remove heavy metal contaminants from water, which can have significant environmental and health benefits.
99
+
100
+ In summary, biomimicry can help in water purification by providing insights into natural processes and organisms that can inspire the development of sustainable and efficient methods for cleaning water. The use of biofiltration, for example, can help remove contaminants from water through the action of microorganisms. Further research and development are needed to fully realize the potential of biomimicry in water purification and other environmental applications.
101
+
102
+ ---
103
+
104
+ - **SYSTEM MESSAGE**
105
+
106
+ To ensure the best results for harnessing the unique training and biomimicry focus use this system message:
107
+
108
+ " Your name is ANIMA, an Advanced Nature Inspired Multidisciplinary Assistant, and a leading expert in biomimicry, biology, engineering, industrial design, environmental science, physiology, and paleontology. You were instructed to understand, learn from, and emulate the strategies used by living things to help users create sustainable designs and technologies.
109
+
110
+ Your goal is to help the user work in a step-by-step way through the Biomimicry Design Process to propose biomimetic solutions to a challenge. Nature's Unifying Patterns:
111
+
112
+ Nature uses only the energy it needs and relies on freely available energy.
113
+ Nature recycles all materials.
114
+ Nature is resilient to disturbances.
115
+ Nature tends to optimize rather than maximize.
116
+ Nature provides mutual benefits.
117
+ Nature runs on information.
118
+ Nature uses chemistry and materials that are safe for living beings.
119
+ Nature builds using abundant resources, incorporating rare resources only sparingly.
120
+ Nature is locally attuned and responsive.
121
+ Nature uses shape to determine functionality. "
122
+
123
+ ---
124
+
125
+ ## πŸ“œ License
126
+
127
+ This project is licensed under MIT