Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Aug 24;7(1):9313.
doi: 10.1038/s41598-017-09287-x.

Effects of short indels on protein structure and function in human genomes

Affiliations

Effects of short indels on protein structure and function in human genomes

Maoxuan Lin et al. Sci Rep. .

Abstract

Insertions and deletions (indels) represent the second most common type of genetic variations in human genomes. Indels can be deleterious and contribute to disease susceptibility as recent genome sequencing projects revealed a large number of indels in various cancer types. In this study, we investigated the possible effects of small coding indels on protein structure and function, and the baseline characteristics of indels in 2504 individuals of 26 populations from the 1000 Genomes Project. We found that each population has a distinct pattern in genes with small indels. Frameshift (FS) indels are enriched in olfactory receptor activity while non-frameshift (NFS) indels are enriched in transcription-related proteins. Structural analysis of NFS indels revealed that they predominantly adopt coil or disordered conformations, especially in proteins with transcription-related NFS indels. These results suggest that the annotated coding indels from the 1000 Genomes Project, while contributing to genetic variations and phenotypic diversity, generally do not affect the core protein structures and have no deleterious effect on essential biological processes. In addition, we found that a number of reference genome annotations might need to be updated due to the high prevalence of annotated homozygous indels in the general population.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Figure 1
Figure 1
Indel size distribution.
Figure 2
Figure 2
Number of unique indels on each chromosome.
Figure 3
Figure 3
Relative positions of NFS (A) and FS (B) indels on proteins.
Figure 4
Figure 4
PCA analysis of indel patterns in 26 populations. (A) All indels; (B) Homozygous indels only
Figure 5
Figure 5
Gene enrichment analysis of genes with NFS or FS indels. (A) Significantly enriched categories in terms of Biological Process; (B) Significantly enriched categories in terms of Molecular Function.
Figure 6
Figure 6
PCA analysis of transcription-related indel patterns in 26 populations. (A) All indels; (B) Homozygous indels only
Figure 7
Figure 7
Secondary structure and residue disorder types for NFS indels. (A) Distribution of secondary structure types of all NFS indels; (B) Distribution of secondary structure types of transcription-related NFS indels; (C) Distribution of residue disorder of all NFS indels; (D) Distribution of residue disorder of transcription-related NFS indels. A residue in an indel is considered “disordered” or “ordered” if both IUPred and DisProt agree; otherwise it is annotated as “inconclusive”.
Figure 8
Figure 8
Comparisons of structural types between high and low allele frequency NFS indels. (A) Distribution of secondary structure types; (B) Distribution of residue disorder. A residue in an indel is considered “disordered” or “ordered” if both IUPred and DisProt predictions agree. Otherwise it is annotated as “Inconclusive”.

References

    1. Mullaney JM, Mills RE, Pittard WS, Devine SE. Small insertions and deletions (INDELs) in human genomes. Human molecular genetics. 2010;19:R131–136. doi: 10.1093/hmg/ddq400. - DOI - PMC - PubMed
    1. Brogna S, Wen J. Nonsense-mediated mRNA decay (NMD) mechanisms. Nature structural & molecular biology. 2009;16:107–113. doi: 10.1038/nsmb.1550. - DOI - PubMed
    1. Kloosterman WP, et al. Characteristics of de novo structural changes in the human genome. Genome Res. 2015;25:792–801. doi: 10.1101/gr.185041.114. - DOI - PMC - PubMed
    1. Bamshad MJ, et al. Exome sequencing as a tool for Mendelian disease gene discovery. Nat Rev Genet. 2011;12:745–755. doi: 10.1038/nrg3031. - DOI - PubMed
    1. Bermejo-Das-Neves C, Nguyen HN, Poch O, Thompson JD. A comprehensive study of small non-frameshift insertions/deletions in proteins and prediction of their phenotypic effects by a machine learning method (KD4i) BMC Bioinformatics. 2014;15 doi: 10.1186/1471-2105-15-111. - DOI - PMC - PubMed

Publication types

LinkOut - more resources