marinkaz commited on
Commit
43d4ff9
1 Parent(s): 8c8abf1

Update readme

Browse files
Files changed (1) hide show
  1. README.md +10 -5
README.md CHANGED
@@ -9,13 +9,17 @@ metrics:
9
  tags:
10
  - biology
11
  - chemistry
 
 
 
 
12
  library_name: tdc
13
  license: bsd-2-clause
14
  ---
15
 
16
  ## Dataset description
17
 
18
- An integrated Ether-a-go-go-related gene (hERG) dataset consisting of molecular structures labelled as hERG (<10uM) and non-hERG (>=10uM) blockers in the form of SMILES strings was obtained from the DeepHIT, the BindingDB database, ChEMBL bioactivity database, and other literature.
19
 
20
  ## Task description
21
  Binary classification. Given a drug SMILES string, predict whether it blocks (1, <10uM) or not blocks (0, >=10uM).
@@ -23,8 +27,8 @@ Binary classification. Given a drug SMILES string, predict whether it blocks (1,
23
  ## Dataset statistics
24
  Total: 13445; Train_val: 12620; Test: 825
25
 
26
- ## Dataset split:
27
- Random split on 70% training, 10% validation, and 20% testing
28
 
29
  To load the dataset in TDC, type
30
 
@@ -46,5 +50,6 @@ dp_model = tdc_hf.load_deeppurpose('./data')
46
  tdc_hf.predict_deeppurpose(dp_model, ['CC(=O)NC1=CC=C(O)C=C1'])
47
  ```
48
 
49
- ## References:
50
- [1] Karim, A., et al. CardioTox net: a robust predictor for hERG channel blockade based on deep learning meta-feature ensembles. J Cheminform 13, 60 (2021). https://doi.org/10.1186/s13321-021-00541-z
 
 
9
  tags:
10
  - biology
11
  - chemistry
12
+ - therapeutic science
13
+ - drug design
14
+ - drug development
15
+ - therapeutics
16
  library_name: tdc
17
  license: bsd-2-clause
18
  ---
19
 
20
  ## Dataset description
21
 
22
+ An integrated Ether-a-go-go-related gene (hERG) dataset consisting of molecular structures labeled as hERG (<10uM) and non-hERG (>=10uM) blockers in the form of SMILES strings was obtained from the DeepHIT, the BindingDB database, ChEMBL bioactivity database, and other literature.
23
 
24
  ## Task description
25
  Binary classification. Given a drug SMILES string, predict whether it blocks (1, <10uM) or not blocks (0, >=10uM).
 
27
  ## Dataset statistics
28
  Total: 13445; Train_val: 12620; Test: 825
29
 
30
+ ## Dataset split
31
+ Random split with 70% training, 10% validation, and 20% testing
32
 
33
  To load the dataset in TDC, type
34
 
 
50
  tdc_hf.predict_deeppurpose(dp_model, ['CC(=O)NC1=CC=C(O)C=C1'])
51
  ```
52
 
53
+ ## References
54
+ * Dataset entry in Therapeutics Data Commons, https://tdcommons.ai/single_pred_tasks/tox
55
+ * Karim, A., et al. CardioTox net: a robust predictor for hERG channel blockade based on deep learning meta-feature ensembles. J Cheminform 13, 60 (2021). https://doi.org/10.1186/s13321-021-00541-z