A multi-objective optimization approach accurately resolves protein domain architectures
- PMID: 26458889
- PMCID: PMC4734041
- DOI: 10.1093/bioinformatics/btv582
A multi-objective optimization approach accurately resolves protein domain architectures
Abstract
Motivation: Given a protein sequence and a number of potential domains matching it, what are the domain content and the most likely domain architecture for the sequence? This problem is of fundamental importance in protein annotation, constituting one of the main steps of all predictive annotation strategies. On the other hand, when potential domains are several and in conflict because of overlapping domain boundaries, finding a solution for the problem might become difficult. An accurate prediction of the domain architecture of a multi-domain protein provides important information for function prediction, comparative genomics and molecular evolution.
Results: We developed DAMA (Domain Annotation by a Multi-objective Approach), a novel approach that identifies architectures through a multi-objective optimization algorithm combining scores of domain matches, previously observed multi-domain co-occurrence and domain overlapping. DAMA has been validated on a known benchmark dataset based on CATH structural domain assignments and on the set of Plasmodium falciparum proteins. When compared with existing tools on both datasets, it outperforms all of them.
Availability and implementation: DAMA software is implemented in C++ and the source code can be found at http://www.lcqb.upmc.fr/DAMA.
Contact: juliana.silva_bernardes@upmc.fr or alessandra.carbone@lip6.fr
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author 2015. Published by Oxford University Press.
Figures




Similar articles
-
Improvement in Protein Domain Identification Is Reached by Breaking Consensus, with the Agreement of Many Profiles and Domain Co-occurrence.PLoS Comput Biol. 2016 Jul 29;12(7):e1005038. doi: 10.1371/journal.pcbi.1005038. eCollection 2016 Jul. PLoS Comput Biol. 2016. PMID: 27472895 Free PMC article.
-
A fast and automated solution for accurately resolving protein domain architectures.Bioinformatics. 2010 Mar 15;26(6):745-51. doi: 10.1093/bioinformatics/btq034. Epub 2010 Jan 29. Bioinformatics. 2010. PMID: 20118117
-
Plasmobase: a comparative database of predicted domain architectures for Plasmodium genomes.Malar J. 2017 Jun 7;16(1):241. doi: 10.1186/s12936-017-1887-8. Malar J. 2017. PMID: 28592293 Free PMC article.
-
MyCLADE: a multi-source domain annotation server for sequence functional exploration.Nucleic Acids Res. 2021 Jul 2;49(W1):W452-W458. doi: 10.1093/nar/gkab395. Nucleic Acids Res. 2021. PMID: 34023906 Free PMC article.
-
Domain Architecture Based Methods for Comparative Functional Genomics Toward Therapeutic Drug Target Discovery.J Mol Evol. 2023 Oct;91(5):598-615. doi: 10.1007/s00239-023-10129-w. Epub 2023 Aug 25. J Mol Evol. 2023. PMID: 37626222 Review.
Cited by
-
Creating and leveraging bespoke large-scale knowledge graphs for comparative genomics and multi-omics drug discovery with SocialGene.bioRxiv [Preprint]. 2024 Aug 19:2024.08.16.608329. doi: 10.1101/2024.08.16.608329. bioRxiv. 2024. PMID: 39229008 Free PMC article. Preprint.
-
Improvement in Protein Domain Identification Is Reached by Breaking Consensus, with the Agreement of Many Profiles and Domain Co-occurrence.PLoS Comput Biol. 2016 Jul 29;12(7):e1005038. doi: 10.1371/journal.pcbi.1005038. eCollection 2016 Jul. PLoS Comput Biol. 2016. PMID: 27472895 Free PMC article.
-
Improving pairwise comparison of protein sequences with domain co-occurrence.PLoS Comput Biol. 2018 Jan 2;14(1):e1005889. doi: 10.1371/journal.pcbi.1005889. eCollection 2018 Jan. PLoS Comput Biol. 2018. PMID: 29293498 Free PMC article.
-
Protein Family Content Uncovers Lineage Relationships and Bacterial Pathway Maintenance Mechanisms in DPANN Archaea.Front Microbiol. 2021 Jun 1;12:660052. doi: 10.3389/fmicb.2021.660052. eCollection 2021. Front Microbiol. 2021. PMID: 34140936 Free PMC article.
-
Domain prediction with probabilistic directional context.Bioinformatics. 2017 Aug 15;33(16):2471-2478. doi: 10.1093/bioinformatics/btx221. Bioinformatics. 2017. PMID: 28407137 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources