Relationship between insertion/deletion (indel) frequency of proteins and essentiality
- PMID: 17598914
- PMCID: PMC1925122
- DOI: 10.1186/1471-2105-8-227
Relationship between insertion/deletion (indel) frequency of proteins and essentiality
Abstract
Background: In a previous study, we demonstrated that some essential proteins from pathogenic organisms contained sizable insertions/deletions (indels) when aligned to human proteins of high sequence similarity. Such indels may provide sufficient spatial differences between the pathogenic protein and human proteins to allow for selective targeting. In one example, an indel difference was targeted via large scale in-silico screening. This resulted in selective antibodies and small compounds which were capable of binding to the deletion-bearing essential pathogen protein without any cross-reactivity to the highly similar human protein. The objective of the current study was to investigate whether indels were found more frequently in essential than non-essential proteins.
Results: We have investigated three species, Bacillus subtilis, Escherichia coli, and Saccharomyces cerevisiae, for which high-quality protein essentiality data is available. Using these data, we demonstrated with t-test calculations that the mean indel frequencies in essential proteins were greater than that of non-essential proteins in the three proteomes. The abundance of indels in both types of proteins was also shown to be accurately modeled by the Weibull distribution. However, Receiver Operator Characteristic (ROC) curves showed that indel frequencies alone could not be used as a marker to accurately discriminate between essential and non-essential proteins in the three proteomes. Finally, we analyzed the protein interaction data available for S. cerevisiae and observed that indel-bearing proteins were involved in more interactions and had greater betweenness values within Protein Interaction Networks (PINs).
Conclusion: Overall, our findings demonstrated that indels were not randomly distributed across the studied proteomes and were likely to occur more often in essential proteins and those that were highly connected, indicating a possible role of sequence insertions and deletions in the regulation and modification of protein-protein interactions. Such observations will provide new insights into indel-based drug design using bioinformatics and cheminformatics tools.
Figures




Similar articles
-
Indel PDB: a database of structural insertions and deletions derived from sequence alignments of closely related proteins.BMC Bioinformatics. 2008 Jun 25;9:293. doi: 10.1186/1471-2105-9-293. BMC Bioinformatics. 2008. PMID: 18578882 Free PMC article.
-
Large-scale survey for potentially targetable indels in bacterial and protozoan proteins.Proteins. 2006 Feb 1;62(2):371-80. doi: 10.1002/prot.20631. Proteins. 2006. PMID: 16315289
-
Insertion-deletion polymorphisms in 3' regions of maize genes occur frequently and can be used as highly informative genetic markers.Plant Mol Biol. 2002 Mar-Apr;48(5-6):539-47. doi: 10.1023/a:1014841612043. Plant Mol Biol. 2002. PMID: 12004893
-
Small Insertions and Deletions Drive Genomic Plasticity during Adaptive Evolution of Yersinia pestis.Microbiol Spectr. 2022 Jun 29;10(3):e0224221. doi: 10.1128/spectrum.02242-21. Epub 2022 Apr 19. Microbiol Spectr. 2022. PMID: 35438532 Free PMC article. Review.
-
Using high-throughput sequencing transcriptome data for INDEL detection: challenges for cancer drug discovery.Expert Opin Drug Discov. 2016;11(3):257-68. doi: 10.1517/17460441.2016.1143813. Epub 2016 Feb 6. Expert Opin Drug Discov. 2016. PMID: 26787005 Review.
Cited by
-
Extensive libraries of gene truncation variants generated by in vitro transposition.Nucleic Acids Res. 2017 Jun 2;45(10):e78. doi: 10.1093/nar/gkx030. Nucleic Acids Res. 2017. PMID: 28130425 Free PMC article.
-
Identifying overlapping mutated driver pathways by constructing gene networks in cancer.BMC Bioinformatics. 2015;16 Suppl 5(Suppl 5):S3. doi: 10.1186/1471-2105-16-S5-S3. Epub 2015 Mar 18. BMC Bioinformatics. 2015. PMID: 25859819 Free PMC article.
-
Built-in loops allow versatility in domain-domain interactions: lessons from self-interacting domains.Proc Natl Acad Sci U S A. 2008 Sep 9;105(36):13292-7. doi: 10.1073/pnas.0801207105. Epub 2008 Aug 29. Proc Natl Acad Sci U S A. 2008. PMID: 18757736 Free PMC article.
-
Different Ways of Doing the Same: Variations in the Two Last Steps of the Purine Biosynthetic Pathway in Prokaryotes.Genome Biol Evol. 2019 Apr 1;11(4):1235-1249. doi: 10.1093/gbe/evz035. Genome Biol Evol. 2019. PMID: 30785193 Free PMC article.
-
A Probabilistic Model for Indel Evolution: Differentiating Insertions from Deletions.Mol Biol Evol. 2021 Dec 9;38(12):5769-5781. doi: 10.1093/molbev/msab266. Mol Biol Evol. 2021. PMID: 34469521 Free PMC article.
References
-
- Fraser CM, Gocayne JD, White O, Adams MD, Clayton RA, Fleischmann RD, Bult CJ, Kerlavage AR, Sutton G, Kelley JM, Fritchman RD, Weidman JF, Small KV, Sandusky M, Fuhrmann J, Nguyen D, Utterback TR, Saudek DM, Phillips CA, Merrick JM, Tomb JF, Dougherty BA, Bott KF, Hu PC, Lucier TS, Peterson SN, Smith HO, Hutchison CA, 3, Venter JC. The minimum gene complement of Mycoplasma genitalium. Science. 1995;270:397–403. doi: 10.1126/science.270.5235.397. - DOI - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases