File size: 4,442 Bytes
49ea493
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2202ad1
49ea493
 
 
 
 
 
 
ee5d4a4
49ea493
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ee5d4a4
49ea493
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ee5d4a4
49ea493
 
 
 
 
 
 
ee5d4a4
49ea493
 
 
 
 
 
 
 
ee5d4a4
49ea493
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
import streamlit as st
import streamlit.components.v1 as components
import py3Dmol
from rdkit import Chem
from rdkit.Chem import Draw
from rdkit.Chem import AllChem

st.title('SMILES  + RDKit + Py3DMOL :smiley:')
def show(smi, style='stick'):
    mol = Chem.MolFromSmiles(smi)
    mol = Chem.AddHs(mol)
    AllChem.EmbedMolecule(mol)
    AllChem.MMFFOptimizeMolecule(mol, maxIters=200)
    mblock = Chem.MolToMolBlock(mol)

    view = py3Dmol.view(width=400, height=400)
    view.addModel(mblock, 'mol')
    view.setStyle({style:{}})
    view.zoomTo()
    view.show()
    view.render()
    t =view.js()
    f = open('viz.html', 'w')
    f.write(t.startjs)
    f.write(t.endjs)
    f.close()

compound_smiles=st.text_input('SMILES please','CC')
m = Chem.MolFromSmiles(compound_smiles)

Draw.MolToFile(m,'mol.png')


show(compound_smiles)
HtmlFile = open("viz.html", 'r', encoding='utf-8')
source_code = HtmlFile.read() 
c1,c2=st.beta_columns(2)
with c1:
  st.write('Molecule :coffee:')
  st.image('mol.png')
with c2:
  components.html(source_code, height = 400,width=400)

################ Sidebar ####################
with st.sidebar.beta_expander('Rule One (Atoms and Bonds)'):
  st.markdown('''
## Atoms
|If |then |
|----|----|
| Non-aromatic atoms |Uper case letters |
| Aromatic atoms |lower case letters |
|Atomic symbols has more than one letter | The second is lower case |
## Bonds
| Bond type| Bond symbol |
|---|---|
|Simple | - |
|Double|=|
|Triple|#|
|Aromatic|*|
| Disconnected structures|. |
### Example:
 CC   👉 There is a non-aromatic carbon attached to another non-aromatic carbon by a single bond.
🛑 A bond between two lower case atom symbols is *aromatic*.
''')

with st.sidebar.beta_expander('Rule Two (Simple Chains)'):
  st.markdown('''
  ## Simple chains
  * Structures are hydrogen suppresed (Molecules represented without hydrogens)
  * If enough bonds are not identified by the user, the system will assume that connections
  are satisfied by hidrogens.
  * The user can explicitly identify hydrogen bonds, but if so the interpreter will assume that all of them are fully identified.
  Note:
  
  *Because SMILES allows entry of all elements in the periodic table, 
  and also utilizes hydrogen suppression, the user should be aware of chemicals with two letters 
  that could be misinterpreted by the computer. For example, 'Sc' could be interpreted as a **sulfur**
  atom connected to an aromatic **carbon** by a single bond, or it could be the symbol for **scandium**. 
  The SMILES interpreter gives priority to the interpretation of a single bond connecting a sulfur atom and an aromatic carbon. 
  To identify scandium the user should enter [Sc]*.
  ''')

with st.sidebar.beta_expander('Rule Three (Branches)'):
  st.markdown('''
  ## Branches
  * A branch from a chain is specified by placing the SMILES symbol(s) for the branch between parenthesis. 
  * The string in parentheses is placed directly after the symbol for the atom to which it is connected. 
  * If it is connected by a double or triple bond, the bond symbol immediately follows the left parenthesis.
  ''')

with st.sidebar.beta_expander('Rule Four (Rings)'):
  st.markdown('''
  ## Rings
  * SMILES allows a user to identify ring structures by using numbers to identify the opening and closing ring atom.
  For example, in C1CCCCC1, the first carbon has a number '1' which connects by a single bond with the last carbon which also has a number '1'. 
  The resulting structure is cyclohexane. Chemicals that have multiple rings may be identified by using different numbers for each ring.
  * If a double, single, or aromatic bond is used for the ring closure, the bond symbol is placed before the ring closure number.
  ''')

with st.sidebar.beta_expander('Rule Five (Charged atoms)'):
  st.markdown('''
  ## Charged atoms
  Charges on an atom can be used to override the knowledge regarding valence that is built into SMILES software. 
  The format for identifying a charged atom consists of the atom followed by brackets which enclose the charge on the atom. 
  The number of charges may be explicitly stated ({-1}) or not ({-}). 
  ''')
st.sidebar.markdown('Original Author: José Manuel Nápoles ([@napoles3d](https://twitter.com/napoles3D)). Find original app in https://share.streamlit.io/napoles-uach/st_smiles/main/smiles.py')
st.sidebar.write('Info about SMILES: https://archive.epa.gov/med/med_archive_03/web/html/smiles.html')