Protein domain recurrence and order can enhance prediction of protein functions

M.A. Messih, M. Chitale, V.B. Bajic, D. Kihara, X. Gao
Bioinformatics, 28(18):i444-i450, (2012)

Protein domain recurrence and order can enhance prediction of protein functions

Keywords

Protein functions

Abstract

Motivation: Burgeoning sequencing technologies have generated massive amounts of genomic and proteomic data. Annotating the functions of proteins identified in this data has become a big and crucial problem. Various computational methods have been developed to infer the protein functions based on either the sequences or domains of proteins. The existing methods, however, ignore the recurrence and the order of the protein domains in this function inference.                  

Results: We developed two new methods to infer protein functions based on protein domain recurrence and domain order. Our first method, DRDO, calculates the posterior probability of the Gene Ontology terms based on domain recurrence and domain order information, whereas our second method, DRDO-NB, relies on the naïve Bayes methodology using the same domain architecture information. Our large-scale benchmark comparisons show strong improvements in the accuracy of the protein function inference achieved by our new methods, demonstrating that domain recurrence and order can provide important information for inference of protein functions.

Code

DOI: 10.1093/bioinformatics/bts398

Sources

Website PDF

See all publications 2012