jannisborn commited on
Commit
d2275f6
1 Parent(s): c351b1e
Files changed (1) hide show
  1. model_cards/article.md +22 -55
model_cards/article.md CHANGED
@@ -1,68 +1,35 @@
1
  # Supported molecular properties
2
 
3
 
4
- ### ClinTox
5
- A [ToxSmi model](https://github.com/PaccMann/toxsmi) trained on [ClinTox](https://moleculenet.org/datasets-1) dataset which has two endpoints: Probability of FDA approval and Probability of failure in clinical trials. When using this model, please cite *Born et al. (2023)* (citation below).
6
 
7
- ### SIDER
8
- A [ToxSmi model](https://github.com/PaccMann/toxsmi) trained on the [SIDER](https://moleculenet.org/datasets-1) dataset for 27 different types of side effects of drugs. When using this model, please cite *Born et al. (2023)* (citation below).
9
 
10
- ### Tox21
11
- A [ToxSmi model](https://github.com/PaccMann/toxsmi) trained on the [Tox21](https://tripod.nih.gov/tox/) dataset with 12 different types of environmental toxicities. When using this model, please cite *Born et al. (2023)* (citation below).
12
 
13
- ### SCScore
14
- Predict the synthetic complexity score (SCScore) as presented in [Coley et al. (*J. Chem. Inf. Model.*; 2018)](https://pubs.acs.org/doi/full/10.1021/acs.jcim.7b00622).
15
 
16
- ### SAS
17
- Estimate the synthetic accessibility score (SAS) as presented in [Ertl et al. (*Journal of Chemoinformatics*; 2009)](https://jcheminf.biomedcentral.com/articles/10.1186/1758-2946-1-8).
18
 
19
- ### Lipinski
20
- Measure whether a molecule confirms to the Lipinski-rule-of-five as presented in [Lipinski et al. (*Advanced Drug Delivery Reviews*; 2001)](https://www.sciencedirect.com/science/article/abs/pii/S0169409X00001290?via%3Dihub).
21
 
22
- ### Penalized logP
23
- Measure the penalized logP (partition coefficient) score as presented in [Gomez-Bombarelli et al. (*ACS Central Science*; 2018)](https://arxiv.org/abs/1610.02415v1). This is the logP minus the number of rings with > 6 atoms minus the SAS.
24
 
25
- ### QED
26
- Measure the drug-likeness as presented in [Bickerton et al. (*Nature Chemistry*; 2012)](https://www.nature.com/articles/nchem.1243).
27
 
28
- ### LogP
29
- Measure the logP (partition coefficient) of a molecule as presented in [Wildman et al. (*J. Chem. Inf. Comput. Sci.*; 1999)](https://pubs.acs.org/doi/full/10.1021/ci990307l).
30
 
31
- ### Bertz
32
- Calculate the total polar surface area of a molecule as presented in [Ertl et al. (*Journal of Medicinal Chemistry*; 2000)](https://pubs.acs.org/doi/full/10.1021/jm000942e).
33
 
34
- ### TPSA
35
- Calculate the first general index of molecular complexity [Bertz (*Journal of the American Chemical Society*; 1981)](https://pubs.acs.org/doi/pdf/10.1021/ja00402a071).
36
-
37
- ### Is-Scaffold
38
- Whether the molecule is identical to its [Murcko scaffold](https://rdkit.org/docs/source/rdkit.Chem.Scaffolds.MurckoScaffold.html).
39
-
40
- ### Number-Of-X
41
- Calculated with [RDKit](https://www.rdkit.org/docs/source/rdkit.Chem.rdchem.html).
42
-
43
- ### Molecular Weight
44
- Calculated with [RDKit](https://www.rdkit.org/docs/source/rdkit.Chem.rdchem.html).
45
-
46
-
47
- ### ToxSmi citation
48
- ```bib
49
- @article{born2023chemical,
50
- title={Chemical representation learning for toxicity prediction},
51
- author={Born, Jannis and Markert, Greta and Janakarajan, Nikita and Kimber, Talia B. and Volkamer, Andrea and Rodriguez Martinez, Maria and Manica, Matteo},
52
- journal={Under review at Digital Discovery},
53
- year={2023}
54
- }
55
- ```
56
-
57
-
58
- ### Unsupported properties
59
- The following molecular properties are available via the GT4SD API but not in this UI:
60
- - [MoleculeOne](https://tdcommons.ai/functions/oracles/#moleculeone) endpoint for retrosynthesis
61
- - [ASKCOS](https://tdcommons.ai/functions/oracles/#askcos) endpoint for retrosynthesis
62
- - [TDC-Docking](https://tdcommons.ai/functions/oracles/#docking-scores) endpoint for docking against a user-provided target
63
- - [TDC-Docking](https://tdcommons.ai/functions/oracles/#docking-scores) endpoint for docking against *3pbl*.
64
- - [Protein-ligand binding](https://tdcommons.ai/functions/oracles/#dopamine-receptor-d2-drd2) against one of the targets *drd2*, *gsk3b*, *jnk3*, *fpscores*, *cyp3a4_veith*, *drd2_current*, *gsk3b_current* or *jnk3_current*.
65
- - [Tanimoto similarity](https://tdcommons.ai/functions/oracles/#similaritydissimilarity) to a seed molecule.
66
-
67
-
68
- Moreover, GT4SD also includes properties on other entities such as [proteins](https://gt4sd.github.io/gt4sd-core/api/gt4sd.properties.proteins.html) and [crystals](https://gt4sd.github.io/gt4sd-core/api/gt4sd.properties.crystals.html).
 
1
  # Supported molecular properties
2
 
3
 
4
+ ### Instability
5
+ Compute the protein instability as presented in [Guruprasad et al. (*Protein Engineering, Design and Selection*; 1990)](https://academic.oup.com/peds/article-abstract/4/2/155/1491271).
6
 
7
+ ### Aromaticity
8
+ Compute the protein aromaticity as presented in [Lobry et al. (*Nucleic Acid Research*; 1994)](https://academic.oup.com/nar/article-abstract/22/15/3174/1087817).
9
 
10
+ ### Isoelectric point
11
+ Computes the isoelectric point of every residue and aggregates.
12
 
13
+ ### Hydrophobicity
14
+ "Computes the hydrophobicity of a protein, relative freq. of **A,C,F,I,L,M & V**.
15
 
16
+ ### Aliphaticity
17
+ Compute the aliphatic index of globular proteins as presented in [Ikai (*The Journal of Biochemistry*; 1980)](https://academic.oup.com/jb/article-abstract/88/6/1895/773432).
18
 
19
+ ### Charge
20
+ Compute the charge of a protein, based on a boolean for the amide (whether the sequences are C-terminally amidated) and a pH value; as presented in [Bjellqvist, (*Electrophoresis*; 1993)](https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/abs/10.1002/elps.11501401163).
21
 
22
+ ### Charge Density
23
+ Computes the charge density of a protein, based on a boolean for the amide (whether the sequences are C-terminally amidated) and a pH value; as presented in [Bjellqvist, (*Electrophoresis*; 1993)](https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/abs/10.1002/elps.11501401163).
24
 
25
+ ### Boman index
26
+ Compute the protein aromaticity as presented in [Boman (*Journal of internal medicine*; 2003)](https://onlinelibrary.wiley.com/doi/full/10.1046/j.1365-2796.2003.01228.x).
27
 
28
+ ### Protein weight
29
+ Compute the molecular weight of a protein with [RDKit](https://www.rdkit.org/docs/GettingStartedInPython.html).
30
 
31
+ ### Length
32
+ Retrieves the number of residues of a protein.
33
 
34
+ Moreover, GT4SD also includes properties on other entities such as [molecules](https://gt4sd.github.io/gt4sd-core/api/gt4sd.properties.molecules.html) and [crystals](https://gt4sd.github.io/gt4sd-core/api/gt4sd.properties.crystals.html).
35
+ The GT4SD web app for molecules can be found [here](https://huggingface.co/spaces/GT4SD/molecular_properties)