bizhen commited on
Commit
00f9a19
1 Parent(s): 68aa19a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -0
README.md CHANGED
@@ -1,3 +1,11 @@
1
  ---
2
  license: apache-2.0
3
  ---
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
  ---
4
+
5
+ license: apache-2.0
6
+ datasets:
7
+ - ProteinKG25
8
+
9
+
10
+ # OntoProtein
11
+ [OntoProtein](https://openreview.net/pdf?id=yfe1VMYAXa4), the first general framework that makes use of structure in GO (Gene Ontology) into protein pre-training models. We construct a novel large-scale knowledge graph that consists of GO and its related proteins, and gene annotation texts or protein sequences describe all nodes in the graph. We propose novel contrastive learning with knowledge-aware negative sampling to jointly optimize the knowledge graph and protein embedding during pre-training.