File size: 11,847 Bytes
d04528b
ef2bfbf
d04528b
 
 
 
 
 
 
 
 
 
 
54769e3
 
 
 
 
 
d04528b
 
54769e3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
d04528b
 
 
 
 
 
 
 
 
0943653
4b33090
d04528b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
af667ea
d04528b
 
 
 
 
ea2d59b
 
 
d04528b
912fb40
d04528b
ea2d59b
0ed7d69
 
00ee56f
26f14db
06d693c
d2600fd
8a993ef
e1b2cfb
 
2322b8c
d04528b
 
 
 
 
1aad4ad
8b9ea5b
d04528b
 
 
 
 
 
 
ef2bfbf
 
d04528b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8b9ea5b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
d04528b
 
 
af667ea
 
 
 
8b9ea5b
 
 
 
d04528b
 
00f69ce
 
54769e3
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
---
license: artistic-2.0
tags:
- chemistry
- biology
- climate
- science
- philosophy
- nature
- ecology
- biomimicry
- fauna
- flora
datasets:
- Severian/Biomimicry
- emrgnt-cmplxty/sciphi-textbooks-are-all-you-need
- fmars/wiki_stem
- fblgit/tree-of-knowledge
- Severian/Bio-Design-Process
metrics:
- accuracy
pipeline_tag: text-generation
model-index:
- name: ANIMA-Phi-Neptune-Mistral-7B-v4
  results:
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: AI2 Reasoning Challenge (25-Shot)
      type: ai2_arc
      config: ARC-Challenge
      split: test
      args:
        num_few_shot: 25
    metrics:
    - type: acc_norm
      value: 55.46
      name: normalized accuracy
    source:
      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Severian/ANIMA-Phi-Neptune-Mistral-7B-v4
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: HellaSwag (10-Shot)
      type: hellaswag
      split: validation
      args:
        num_few_shot: 10
    metrics:
    - type: acc_norm
      value: 77.63
      name: normalized accuracy
    source:
      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Severian/ANIMA-Phi-Neptune-Mistral-7B-v4
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: MMLU (5-Shot)
      type: cais/mmlu
      config: all
      split: test
      args:
        num_few_shot: 5
    metrics:
    - type: acc
      value: 53.12
      name: accuracy
    source:
      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Severian/ANIMA-Phi-Neptune-Mistral-7B-v4
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: TruthfulQA (0-shot)
      type: truthful_qa
      config: multiple_choice
      split: validation
      args:
        num_few_shot: 0
    metrics:
    - type: mc2
      value: 59.01
    source:
      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Severian/ANIMA-Phi-Neptune-Mistral-7B-v4
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: Winogrande (5-shot)
      type: winogrande
      config: winogrande_xl
      split: validation
      args:
        num_few_shot: 5
    metrics:
    - type: acc
      value: 73.48
      name: accuracy
    source:
      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Severian/ANIMA-Phi-Neptune-Mistral-7B-v4
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: GSM8k (5-shot)
      type: gsm8k
      config: main
      split: test
      args:
        num_few_shot: 5
    metrics:
    - type: acc
      value: 14.94
      name: accuracy
    source:
      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Severian/ANIMA-Phi-Neptune-Mistral-7B-v4
      name: Open LLM Leaderboard
---
# ANIMA-Phi-Neptune-Mistral-7B: Biomimicry Enhanced LLM

<img src="https://cdn-uploads.huggingface.co/production/uploads/64740cf7485a7c8e1bd51ac9/JZH6p50t_j3-OUph4Wq6y.png" width="500">

## Overview

**ANIMA** (Advanced Nature Inspired Multidisciplinary Assistant) is an expert in various scientific disciplines, including but not limited to biomimicry, biology, and environmental science.

**Instagram: [@anima_llm](https://www.instagram.com/anima_llm)**

---

##  Model Description

ANIMA is fine-tuned on a rich dataset encompassing:

- 4,000+ Nature-Biomimicry examples
- 60k Biomimicry Design Process examples
- 600k STEM facts from Wikipedia
- Science/Philosophy focused 'All-You-Need-Is-Textbooks' dataset
- Additional Tree of Knowledge + Biomimicry data combined fine-tuning

The model aims to assist users in solving problems using nature-inspired strategies and concepts.

### Special Features

- **Multi-disciplinary Expertise**: Knowledge across various scientific and philosophical domains.
- **Biomimicry Design Process**: Incorporates a dataset generated by Mistral and Minotaur-15B. The dataset was then intricately processed by a real person to ensure factuality and grounding. 

---

- Here is a link to The Bloke's GGUF version: [ANIMA-Phi-Neptune-Mistral-7B-GGUF](https://huggingface.co/TheBloke/ANIMA-Phi-Neptune-Mistral-7B-GGUF)


- ANIMA is also available using Ollama - Download the [OLLAMA](https://ollama.ai/) App (MacOS & Linux) and then run this command in your Terminal '**ollama pull severian/anima**' to download the model and then run this command '**ollama run severian/anima**' to load the model and start talking.


- You can also download and use the model with [LM Studio](https://lmstudio.ai/) (All OS systems). Just download the app and then search for 'ANIMA GGUF' in the search bar and you will have a list of versions to choose from.


- Want to test ANIMA + Ollama and chat right away? Download the model from Ollama and head here to chat with ANIMA right in your browser! [ANIMA - Chat](https://severian42.github.io/ANIMA-Chat/)

- Have a PDF you want to discuss with ANIMA + Ollama? Head here and you can do just that in your browser, 100% locally. [ANIMA - Locally Chat over your PDF](https://ANIMA-pdf-chat.vercel.app/)

- ANIMA is also being hosted on a Hugging Face Space if you'd like to try it there (It may be slow to generate a respone but it still works just fine) [ANIMA - HF Space](https://huggingface.co/spaces/Severian/ANIMA-7B-Biomimicry-LLM)

- **Contact**
  
If you want to discuss the model/dataset/concept further, have any cool ideas, want to collaborate or if you run into any issues with the model, please don't hesitate to reach out at InterwovenArkitech@Gmail.com 

## Datasets

- `Severian/Biomimicry` (1st Fine-Tune)
- `emrgnt-cmplxty/sciphi-textbooks-are-all-you-need` (2nd Fine-Tune)
- `Severian/Bio-Design-Process` & `fmars/wiki_stem` (3rd Fine-Tune)
- `fblgit/tree-of-knowledge` & `Severian/Biomimicry` (4th Fine-Tune)
- `Combined all previous datasets` into one giant dataset for a quick fine-tuning for a total of 730k data points (5th Fine-Tune)

## Training Stages

1. **Base Model**: `ehartford/dolphin-2.0-mistral-7b`
2. **1st Fine-Tune**: `ANIMA-Echo-Mistral-7B-v1` - 1 Epoch x 5hrs on 1 x T4 Small 
3. **2nd Fine-Tune**: `ANIMA-Phi-Neptune-Mistral-v1`-: 1 Epoch x 1hr on 1 x A100 
4. **3rd Fine-Tune**: `ANIMA-Phi-Neptune-Mistral-7B-v2` - 1 Epoch x 1hr on 1 x A100 
5. **4th Fine-Tune**: `ANIMA-Phi-Neptune-Mistral-7B-v3` - 1 Epoch x 1hr on 1 x A100
6. **5th Fine-Tune**: `ANIMA-Phi-Neptune-Mistral-7B-v4` - 1 Epoch x 15mins on 1 x A100 

---

## ๐Ÿ“ Usage

ANIMA is designed around the Biomimicry Design Process to guide users in solving problems through biomimicry. For more information and examples, please visit [Biomimicry Design Process](https://toolbox.biomimicry.org/methods/process/).

## ๐Ÿ Benchmarks

- Average - 62.22
- ARC - 56.83
- HellaSwag - 78.82
- MMLU - 53.84
- TruthfulQA - 59.40

---

- **EXAMPLE OUTPUT**

  [INST] How can biomimicry help in water purification? [/INST]

Biomimicry plays a significant role in water purification by taking 
inspiration from natural processes and organisms to develop sustainable 
and efficient solutions. Here's a comprehensive overview of how biomimicry
can help in water purification:

1. Microbial filtration:
   - Bacteriophages (viruses that infect bacteria) can effectively target 
contaminants in water systems, such as Escherichia coli and Staphylococcus
aureus. By mimicking the natural defense mechanisms of bacteriophages, we 
can develop efficient filtration systems that remove pollutants from water
sources.
   - Some fungi, such as Trametes versicolor, can break down organic 
pollutants in water using enzymes. This process helps make contaminated 
water safe for consumption. By studying and replicating these natural 
processes, we can develop sustainable filtration systems that remove 
pollutants from water sources effectively.

2. Biofilm-based filtration:
   - Microorganisms form biofilms on surfaces to protect themselves from 
environmental threats like predators or contaminants. By mimicking the 
natural formation of biofilms, we can develop sustainable filtration 
systems that use bacteria or fungi to remove pollutants from water 
sources.
   - For example, a study has shown how Pseudomonas putida could 
be used to biodegrade petroleum hydrocarbons in contaminated water sources
using a biofilm-based system. This method offers an eco-friendly 
alternative to conventional chemical treatments for oil spills.

3. Wetland-based filtration:
   - Wetlands act as natural filters for water by retaining sediments and 
organic matter, which helps remove contaminants. By mimicking the natural 
processes of a wetland, we can design effective water purification systems
that use sand filters or engineered wetlands to remove pollutants. For 
instance, a study has shown how reed beds can be used for water filtration
by using them as a natural treatment system for agricultural wastewater.

4. Sponge filtration:
   - Some marine sponges (e.g., sea sponge) have evolved sophisticated 
mechanisms to filter out particles and contaminants from the surrounding 
water. By studying these natural filtration processes, we can develop 
efficient and sustainable filtration systems that remove pollutants 
effectively. For example, a study has shown how sea sponges can be used as
a model for designing a low-cost, self-sustaining water purification 
system that uses solar energy to disinfect contaminated water.

5. Biodegradable materials:
   - Natural organisms have evolved biodegradable materials to protect 
themselves from predators or environmental threats. For instance, some 
insects produce natural pesticides (such as alkaloids) to defend 
themselves against predators. Similarly, we can develop biodegradable 
materials that release these natural pesticides in contaminated water 
sources, killing off contaminants and restoring the environment's health.
   - For example, a study has shown how bacterial cells can be used as a 
natural filtration system to remove contaminants from water sources by 
incorporating them into membranes that selectively remove organic 
pollutants.

In conclusion, biomimicry plays a vital role in water purification by 
using natural processes and organisms as inspiration for developing 
sustainable and efficient solutions. By studying the natural filtration 
mechanisms of microorganisms, plants, animals, and other living organisms,
we can design eco-friendly and cost-effective water purification systems 
that help protect our environment from contamination.

---

### ***[Help Support ANIMA](https://www.buymeacoffee.com/anima7b)***

---

- **KNOWN ISSUES**

  The model will sometimes respond to itself and continue the conversation taking both the user and AI roles. This is a known issue in the Mistral model but does not happen very often. 

## ๐Ÿ“œ License

This project is licensed under Artistic-2.0

**This model is for research purposes only and restricted from any commercial use**
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Severian__ANIMA-Phi-Neptune-Mistral-7B-v4)

|             Metric              |Value|
|---------------------------------|----:|
|Avg.                             |55.61|
|AI2 Reasoning Challenge (25-Shot)|55.46|
|HellaSwag (10-Shot)              |77.63|
|MMLU (5-Shot)                    |53.12|
|TruthfulQA (0-shot)              |59.01|
|Winogrande (5-shot)              |73.48|
|GSM8k (5-shot)                   |14.94|