Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2010 Dec;9(5-6):405-15.
doi: 10.1093/bfgp/elq025. Epub 2011 Jan 6.

Detecting structural variations in the human genome using next generation sequencing

Affiliations
Review

Detecting structural variations in the human genome using next generation sequencing

Ruibin Xi et al. Brief Funct Genomics. 2010 Dec.

Abstract

Structural variations are widespread in the human genome and can serve as genetic markers in clinical and evolutionary studies. With the advances in the next-generation sequencing technology, recent methods allow for identification of structural variations with unprecedented resolution and accuracy. They also provide opportunities to discover variants that could not be detected on conventional microarray-based platforms, such as dosage-invariant chromosomal translocations and inversions. In this review, we will describe some of the sequencing-based algorithms for detection of structural variations and discuss the key issues in future development.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
CNV detection using single-end reads with a control (reference) genome. The triangles represent mapped read positions along the case and control genomes. Regions in which the tag counts are different with statistical significance are identified as potential CNVs.
Figure 2:
Figure 2:
Configurations of PEMs in various types of SVs. (A) Deletion. The paired-end read spans the breakpoint of a deletion. Thus, the mapped distance of the paired-end reads is significantly larger than the insert size. (B) Insertion. The paired-end reads spans an insertion, and the mapped distance significantly less than the insert size. (C) Inversion. The read pair encompasses one breakpoint of an inversion and the right end is mapped with incorrect orientation. (D) Tandem duplication. The read pair spans the middle breakpoint of a tandem duplication. The PEM will have correct orientation but with reverse order. (E) Intra-chromosomal translocation. Two read pairs span the two breakpoints of an intra-chromosomal translocation with one pair having a large mapped distance and the other having correct orientation but their ordering reversed. (F) Inter-chromosomal translocation. The two ends of the pair are mapped to different chromosomes. (G) One-end unmapped. The sequenced genome has a DNA segment that does not exist in the reference genome. One end of the pair is mappable but the other is not.
Figure 3:
Figure 3:
The insert size is larger than the duplicated segment in a tandem duplication. (A) The read pair spans the middle breakpoint. This could be misleading because one may mistakenly conclude that there is an insertion between the pair. (B) The read pair spans one of the duplicated segments.
Figure 4:
Figure 4:
Local distribution of mapped distances of the paired-end reads. (A) A homozygous deletion. The distribution is shifted to right, where the gray curve is the global distribution of the mapped distance. (B) A heterozygous deletion. The distribution is a mixture of two distributions and therefore is a bimodal distribution.
Figure 5:
Figure 5:
Split mapping. (A) The read that spans the breakpoint of a deletion can be split and mapped to two positions on the reference genome. (B) The read that contains a small insertion is mapped to the reference genome after removing the small insertion.

References

    1. Freeman JL, Perry GH, Feuk L, et al. Copy number variation: new insights in genome diversity. Genome Res. 2006;16:949–61. - PubMed
    1. International HapMap Consortium. A haplotype map of the human genome. Nature. 2005;437:1299–320. - PMC - PubMed
    1. Check E. Human genome: patchwork people. Nature. 2005;437:1084–6. - PubMed
    1. Conrad DF, Pinto D, Redon R, et al. Origins and functional impact of copy number variation in the human genome. Nature. 2010;464:704–12. - PMC - PubMed
    1. Buchanan JA, Scherer SW. Contemplating effects of genomic structural variation. Genet Med. 2008;10:639–47. - PubMed

Publication types

Substances