Domain2GO / pages /About.py
Erva Ulusoy
updated paper links
eba261e
import streamlit as st
st.sidebar.markdown('''
# Sections
- [How it works](#how-it-works)
- [Schematic overview of Domain2GO](#schematic-overview)
''', unsafe_allow_html=True)
st.markdown('''
# Mutual Annotation-Based Prediction of Protein Domain Functions with Domain2GO
''')
st.markdown(
"""
[![publication](https://img.shields.io/badge/DOI-10.1002/pro.4988-b31b1b.svg)](https://doi.org/10.1002/pro.4988) [![github-repository](https://img.shields.io/badge/GitHub-black?logo=github)](https://github.com/HUBioDataLab/Domain2GO)
""")
st.markdown('<p style="font-size:18px; font-weight:bold">Developers: Erva Ulusoy, Tunca Dogan</p>', unsafe_allow_html=True)
st.header('How it works', anchor='how-it-works')
# st.markdown(
# '<p style="font-size:25px; font-weight:bold">How it works</p>', unsafe_allow_html=True)
st.markdown(
"""
Domain2GO mappings are a set of association predictions between protein domains and Gene Ontology (GO) terms. This tool provides a user-friendly interface to predict protein functions by propagating Domain2GO mappings to query proteins that are annotated with those domains. This operation is shown in [panel (F) of the figure below](#schematic-overview).
Domain2GO is developed with the aim of identifying unknown protein functions by associating domains with Gene Ontology terms, thus defining the problem as domain function prediction. Domain2GO mappings are generated using the existing domain and GO annotation data. In order to obtain highly reliable associations, we employed statistical resampling and analyzed the co-occurrence patterns of domains and GO terms on the same proteins.
We applied Domain2GO to predict protein functions, by propagating domain-associated GO terms to proteins that are annotated with those domains. For protein function prediction performance evaluation and comparison against other methods, we employed CAFA3 challenge datasets. The results demonstrated the potential of Domain2GO, especially when predicting molecular function and biological process terms, as it performed better than baseline predictors and curated associations (Fmax = 0.48 and 0.36 for MFO and BPO, respectively).
For more information on the construction of Domain2GO mappings, statistical analysis of mappings, calculation of probability scores and protein function prediction performance evaluation, please refer to our publication. If you use Domain2GO in your research, please cite the following paper:
Ulusoy E, Doğan T. Mutual annotation-based prediction of protein domain functions with Domain2GO. Protein Science. 2024; 33(6): e4988. [Link](https://doi.org/10.1002/pro.4988)
Overall workflow of Domain2GO is shown below.
""")
st.header('Schematic overview of Domain2GO', anchor='schematic-overview')
st.image('figures/full_methodology.png', width=700)
st.markdown(
'<p style="text-align:center"><em><strong>Schematic representation of the proposed method. (A)</strong> The source datasets were downloaded and organized; <strong>(B)</strong> initial mapping between the InterPro domains and GO terms were obtained, and the mapping parameters were calculated; <strong>(C)</strong> generation of the randomized annotation and mapping sets were constructed; <strong>(D)</strong> co-occurrence similarity distributions were plotted, and thresholds were selected based on statistical resampling; <strong>(E)</strong> an ablation study was conducted by calculating the enrichment of top predictions ranked by different statistical measures and finalized Domain2GO mappings were generated by filtering initial mappings; <strong>(F)</strong> protein function predictions were generated by propagating Domain2GO mappings to target proteins.</em></p>',
unsafe_allow_html=True)