Prioritisation of associations between protein domains and complex diseases using domain-domain interaction networks
- PMID: 20500001
- DOI: 10.1049/iet-syb.2009.0037
Prioritisation of associations between protein domains and complex diseases using domain-domain interaction networks
Abstract
It is of vital importance to find genetic variants that underlie human complex diseases and locate genes that are responsible for these diseases. Since proteins are typically composed of several structural domains, it is reasonable to assume that harmful genetic variants may alter structures of protein domains, affect functions of proteins and eventually cause disorders. With this understanding, the authors explore the possibility of recovering associations between protein domains and complex diseases. The authors define associations between protein domains and disease families on the basis of associations between non-synonymous single nucleotide polymorphisms (nsSNPs) and complex diseases, similarities between diseases, and relations between proteins and domains. Based on a domain-domain interaction network, the authors propose a 'guilt-by-proximity' principle to rank candidate domains according to their average distance to a set of seed domains in the domain-domain interaction network. The authors validate the method through large-scale cross-validation experiments on simulated linkage intervals, random controls and the whole genome. Results show that areas under receiver operating characteristic curves (AUC scores) can be as high as 77.90%, and the mean rank ratios can be as low as 21.82%. The authors further offer a freely accessible web interface for a genome-wide landscape of associations between domains and disease families.
Similar articles
-
Architecture of basic building blocks in protein and domain structural interaction networks.Bioinformatics. 2005 Apr 15;21(8):1479-86. doi: 10.1093/bioinformatics/bti240. Epub 2004 Dec 21. Bioinformatics. 2005. PMID: 15613386
-
Functional evaluation of domain-domain interactions and human protein interaction networks.Bioinformatics. 2007 Apr 1;23(7):859-65. doi: 10.1093/bioinformatics/btm012. Bioinformatics. 2007. PMID: 17456608
-
PIBASE: a comprehensive database of structurally defined protein interfaces.Bioinformatics. 2005 May 1;21(9):1901-7. doi: 10.1093/bioinformatics/bti277. Epub 2005 Jan 18. Bioinformatics. 2005. PMID: 15657096
-
Computational prediction of protein-protein interactions.Methods Mol Biol. 2004;261:445-68. doi: 10.1385/1-59259-762-9:445. Methods Mol Biol. 2004. PMID: 15064475 Review.
-
Methods to reveal domain networks.Drug Discov Today. 2005 Aug 15;10(16):1111-7. doi: 10.1016/S1359-6446(05)03513-0. Drug Discov Today. 2005. PMID: 16182196 Review.
Cited by
-
Protein structural domain-disease association prediction based on heterogeneous networks.BMC Genomics. 2025 Apr 10;23(Suppl 6):869. doi: 10.1186/s12864-024-11117-0. BMC Genomics. 2025. PMID: 40211147 Free PMC article.
-
ProphNet: a generic prioritization method through propagation of information.BMC Bioinformatics. 2014;15 Suppl 1(Suppl 1):S5. doi: 10.1186/1471-2105-15-S1-S5. Epub 2014 Jan 10. BMC Bioinformatics. 2014. PMID: 24564336 Free PMC article.
-
Inference of domain-disease associations from domain-protein, protein-disease and disease-disease relationships.BMC Syst Biol. 2016 Jan 11;10 Suppl 1(Suppl 1):4. doi: 10.1186/s12918-015-0247-y. BMC Syst Biol. 2016. PMID: 26818594 Free PMC article.
-
DomainRBF: a Bayesian regression approach to the prioritization of candidate domains for complex diseases.BMC Syst Biol. 2011 Apr 19;5:55. doi: 10.1186/1752-0509-5-55. BMC Syst Biol. 2011. PMID: 21504591 Free PMC article.
-
Integrating multiple protein-protein interaction networks to prioritize disease genes: a Bayesian regression approach.BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S11. doi: 10.1186/1471-2105-12-S1-S11. BMC Bioinformatics. 2011. PMID: 21342540 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources