ZJU-Fangyin
commited on
Commit
·
d24eb2c
1
Parent(s):
1ef93e0
Update README.md
Browse files
README.md
CHANGED
@@ -8,7 +8,7 @@ tags:
|
|
8 |
---
|
9 |
|
10 |
|
11 |
-
This repo contains a low-rank adapter for [LLaMA2-7b-chat](https://huggingface.co/meta-llama/Llama-2-7b-chat), trained on the
|
12 |
|
13 |
|
14 |
Instructions for running it can be found at https://github.com/zjunlp/Mol-Instructions.
|
@@ -18,74 +18,74 @@ Instructions for running it can be found at https://github.com/zjunlp/Mol-Instru
|
|
18 |
|
19 |
![image.png](logo.png)
|
20 |
|
21 |
-
<h3>
|
22 |
-
|
23 |
|
24 |
<details>
|
25 |
-
<summary><b>
|
|
|
|
|
|
|
26 |
|
27 |
-
- *Please give me some details about this molecule:*
|
28 |
-
[C][C][C][C][C][C][C][C][C][C][C][C][C][C][C][C][C][C][=Branch1][C][=O][O][C@H1][Branch2][Ring1][=Branch1][C][O][C][=Branch1][C][=O][C][C][C][C][C][C][C][C][C][C][C][C][C][C][C][C][O][P][=Branch1][C][=O][Branch1][C][O][O][C][C@@H1][Branch1][=Branch1][C][=Branch1][C][=O][O][N]
|
29 |
-
|
30 |
```
|
31 |
-
|
32 |
-
It is functionally related to an arachidonic acid and an octadecanoic acid.
|
33 |
```
|
34 |
</details>
|
35 |
|
|
|
36 |
<details>
|
37 |
-
<summary><b>
|
38 |
-
|
39 |
-
- *Create a molecule with the structure as the one described:*
|
40 |
-
The molecule is a primary arylamine in which an amino functional group is substituted for one of the benzene hydrogens. It is a primary arylamine and a member of anilines.
|
41 |
|
|
|
|
|
|
|
42 |
```
|
43 |
-
|
44 |
```
|
45 |
</details>
|
46 |
|
|
|
47 |
<details>
|
48 |
-
<summary><b>
|
49 |
|
50 |
-
- *
|
51 |
-
|
52 |
-
|
53 |
-
|
54 |
-
|
55 |
```
|
56 |
</details>
|
57 |
|
|
|
58 |
<details>
|
59 |
-
<summary><b>
|
60 |
-
|
61 |
-
- *Please suggest potential reactants used in the synthesis of the provided product:*
|
62 |
-
[C][=C][C][C][N][C][=Branch1][C][=O][O][C][Branch1][C][C][Branch1][C][C][C]
|
63 |
|
|
|
|
|
|
|
64 |
```
|
65 |
-
|
66 |
```
|
67 |
</details>
|
68 |
|
69 |
|
70 |
<details>
|
71 |
-
<summary><b>
|
72 |
|
73 |
-
- *
|
74 |
-
[C][C][=C][C][=C][Branch1][C][N][C][=N][Ring1][#Branch1].[O][=C][Branch1][C][Cl][C][Cl]>>[C][C][=C][C][=C][Branch1][Branch2][N][C][=Branch1][C][=O][C][Cl][C][=N][Ring1][O]
|
75 |
|
76 |
```
|
77 |
-
|
78 |
```
|
79 |
</details>
|
80 |
|
|
|
81 |
<details>
|
82 |
-
<summary><b>
|
83 |
|
84 |
-
- *
|
85 |
-
[C][C][O][C][C][Branch1][C][C][C][Branch1][C][C][C]
|
86 |
|
87 |
```
|
88 |
-
|
89 |
```
|
90 |
</details>
|
91 |
|
@@ -105,7 +105,7 @@ As illustrated in [our repository](https://github.com/zjunlp/Mol-Instructions/tr
|
|
105 |
|
106 |
Please download [Llama-2-7b-chat](https://huggingface.co/meta-llama/Llama-2-7b-chat) to obtain the pre-training weights of LlamA-2-7b-chat, refine the `--base_model` to point towards the location where the model weights are saved.
|
107 |
|
108 |
-
For model fine-tuned on **
|
109 |
|
110 |
|
111 |
|
|
|
8 |
---
|
9 |
|
10 |
|
11 |
+
This repo contains a low-rank adapter for [LLaMA2-7b-chat](https://huggingface.co/meta-llama/Llama-2-7b-chat), trained on the 🥼 **biomolecule text instructions** from the 🧪 [Mol-Instructions](https://huggingface.co/datasets/zjunlp/Mol-Instructions) dataset.
|
12 |
|
13 |
|
14 |
Instructions for running it can be found at https://github.com/zjunlp/Mol-Instructions.
|
|
|
18 |
|
19 |
![image.png](logo.png)
|
20 |
|
21 |
+
<h3> 🥼 Tasks</h3>
|
|
|
22 |
|
23 |
<details>
|
24 |
+
<summary><b>Chemical entity recognition</b></summary>
|
25 |
+
|
26 |
+
- *Find and list all the instances of the chemical entities in the following content:*
|
27 |
+
"Both the control and caramiphen groups with double cannulas had significantly shorter latencies to seizure onset than the corresponding groups with single cannula."
|
28 |
|
|
|
|
|
|
|
29 |
```
|
30 |
+
caramiphen
|
|
|
31 |
```
|
32 |
</details>
|
33 |
|
34 |
+
|
35 |
<details>
|
36 |
+
<summary><b>Chemical-disease interaction extraction</b></summary>
|
|
|
|
|
|
|
37 |
|
38 |
+
- *You are provided with a set of clinical trial summaries. Extract the chemical-disease relations from the summaries and present your findings in the format of (Subject, Object):*
|
39 |
+
"Eating disorders and the associated behavioural problems and drug abuse are uncommon in pregnancy. When they do occur they are often unrecognized because of denial but when significant may pose a risk to both the mother and her fetus. This case illustrates a number of problems that may be encountered in women with eating disorders in pregnancy, including prolonged and recurrent metabolic disturbances and diuretic abuse. In particular it illustrates the derangements of thyroid function seen in pregnant women with eating disorders and reminds us that when a cause for thyrotoxicosis remains obscure, thyroxine abuse should be considered and explored."
|
40 |
+
|
41 |
```
|
42 |
+
(thyroxine, thyrotoxicosis)
|
43 |
```
|
44 |
</details>
|
45 |
|
46 |
+
|
47 |
<details>
|
48 |
+
<summary><b>Chemical-protein interaction extraction</b></summary>
|
49 |
|
50 |
+
- *Your task is to extract the subject and object entities from the provided text and establish the corresponding relation triples. Please format your answers as (Subject, Relation, Object):*
|
51 |
+
"Opioid receptors are the membrane proteins that mediate the pain-relieving effect of opioid drugs, such as morphine and fentanyl as well as endogenous opioid peptides enkephalins and endorphins. Using cDNAs for the mu and the kappa opioid receptors, we mapped the chromosomal locations of their genes in mouse. Multilocus cross analysis located the mu receptor gene Oprm on Chr 10 and the kappa receptor gene Oprk1 on Chr 1. Both genes are near centromere, with no markers more centromeric. These data indicate that the two opioid receptors are different gene products, ruling out the possibility that they may be differential splicing products from the same gene."
|
52 |
+
|
53 |
+
```
|
54 |
+
(morphine, regulator, Opioid receptors), (fentanyl, regulator, Opioid receptors)
|
55 |
```
|
56 |
</details>
|
57 |
|
58 |
+
|
59 |
<details>
|
60 |
+
<summary><b>Multiple-choice question</b></summary>
|
|
|
|
|
|
|
61 |
|
62 |
+
- *Which of the following conditions does not show multifactorial inheritance? What of the following is the right choice?*
|
63 |
+
(A) Pyloric stenosis (B) Schizophrenia (C) Spina bifida (neural tube defects) (D) Marfan syndrome
|
64 |
+
|
65 |
```
|
66 |
+
The final answer is (D).
|
67 |
```
|
68 |
</details>
|
69 |
|
70 |
|
71 |
<details>
|
72 |
+
<summary><b>True or False question</b></summary>
|
73 |
|
74 |
+
- *Does feeding tube insertion and its timing improve survival?*
|
|
|
75 |
|
76 |
```
|
77 |
+
No, neither insertion of PEG tubes nor timing of insertion affect survival.
|
78 |
```
|
79 |
</details>
|
80 |
|
81 |
+
|
82 |
<details>
|
83 |
+
<summary><b>Open question</b></summary>
|
84 |
|
85 |
+
- *What virus causes chronic disease?*
|
|
|
86 |
|
87 |
```
|
88 |
+
Hepatitis C virus (HCV).
|
89 |
```
|
90 |
</details>
|
91 |
|
|
|
105 |
|
106 |
Please download [Llama-2-7b-chat](https://huggingface.co/meta-llama/Llama-2-7b-chat) to obtain the pre-training weights of LlamA-2-7b-chat, refine the `--base_model` to point towards the location where the model weights are saved.
|
107 |
|
108 |
+
For model fine-tuned on **biomolecular text** instructions, set `$FINETUNED_MODEL_PATH` to `'zjunlp/llama2-molinst-molecule-7b'`.
|
109 |
|
110 |
|
111 |
|