Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Aug 15;31(16):2741-4.
doi: 10.1093/bioinformatics/btv204. Epub 2015 Apr 10.

MetaSV: an accurate and integrative structural-variant caller for next generation sequencing

Affiliations

MetaSV: an accurate and integrative structural-variant caller for next generation sequencing

Marghoob Mohiyuddin et al. Bioinformatics. .

Abstract

Structural variations (SVs) are large genomic rearrangements that vary significantly in size, making them challenging to detect with the relatively short reads from next-generation sequencing (NGS). Different SV detection methods have been developed; however, each is limited to specific kinds of SVs with varying accuracy and resolution. Previous works have attempted to combine different methods, but they still suffer from poor accuracy particularly for insertions. We propose MetaSV, an integrated SV caller which leverages multiple orthogonal SV signals for high accuracy and resolution. MetaSV proceeds by merging SVs from multiple tools for all types of SVs. It also analyzes soft-clipped reads from alignment to detect insertions accurately since existing tools underestimate insertion SVs. Local assembly in combination with dynamic programming is used to improve breakpoint resolution. Paired-end and coverage information is used to predict SV genotypes. Using simulation and experimental data, we demonstrate the effectiveness of MetaSV across various SV types and sizes.

Availability and implementation: Code in Python is at http://bioinform.github.io/metasv/.

Contact: rd@bina.com

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
High-level view of the MetaSV methodology
Fig. 2.
Fig. 2.
Accuracy comparisons for deletions and insertions. Accuracy metrics are shown on a per size bin basis in the plots. The tables below the plots show the aggregate accuracy scores. If a tool does not support detecting the SV type, an NA is indicated in the table. Each tool name is color coded to match the color code in the plots. DELLY’s suboptimal deletion performance was due to its lower breakpoint resolution. For insertions, although Pindel’s sensitivity was close to MetaSV, it had a significantly lower precision and overall accuracy

References

    1. Abyzov A., Gerstein M. (2011) AGE: defining breakpoints of genomic structural variants at single-nucleotide resolution, through optimal alignments with gap excision. Bioinformatics, 27, 595–603. - PMC - PubMed
    1. Abyzov A., et al. . (2011) CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res., 21, 974–984. - PMC - PubMed
    1. Abyzov A., et al. . (2015) Analysis of deletion breakpoints from 1,092 humans reveals details of mutation mechanisms. Nat. Commun., 6, 7256, doi: 10.1038/ncomms8256. - PMC - PubMed
    1. Bankevich A., et al. . (2012) SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol., 19, 455–477. - PMC - PubMed
    1. Chen K., et al. . (2009) BreakDancer: an algorithm for high-resolution mapping of genomic structural variation. Nat. Methods, 6, 677–681. - PMC - PubMed

Publication types