Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Oct:2019:19243617.
doi: 10.1109/bibe.2019.00020. Epub 2019 Dec 26.

Nanopore Guided Assembly of Segmental Duplications near Telomeres

Affiliations

Nanopore Guided Assembly of Segmental Duplications near Telomeres

Eleni Adam et al. Proc IEEE Int Symp Bioinformatics Bioeng. 2019 Oct.

Abstract

Human subtelomere regions are highly enriched in large segmental duplications and structural variants, leading to many gaps and misassemblies in these regions. We develop a novel method, NPGREAT (NanoPore Guided REgional Assembly Tool), which combines Nanopore ultralong read datasets and short-read assemblies derived from 10x linked-reads to efficiently assemble these subtelomere regions into a single continuous sequence. We show that with the use of ultralong Nanopore reads as a guide, the highly accurate shorter linked-read sequence contigs are correctly oriented, ordered, spaced and extended. In the rare cases where a linked-read sequence contig contains inaccurately assembled segments, the use of Nanopore reads allows for detection and correction of this error. We tested NPGREAT on four representative subtelomeres of the NA12878 human genome (10p, 16p, 19q and 20p). The results demonstrate that the final computed assembly of each subtelomere is accurate and complete.

Keywords: genome assembly; linked reads sequencing; nanopore; segmental duplications; subtelomere.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
The assembly algorithm. Input: The selected ultralong Nanopore reads (NReads), the selected short REXTAL contigs (RContigs) and the chromosome number (chrID). Output: The assembled sequence (sequence).
Figure 2.
Figure 2.
Definition of a gap or an overlap between two REXTAL contigs. The red color designates the border local alignments of the neighboring REXTAL contigs and the blue color shows the Nanopore read’s alignment with the two contigs. The green color and prime letters constitute REXTAL contigs extended in repeat-masked parts of Nanopore reads. The purple color defines the bridging region. (a) A gap between the two contigs. (b) An overlap between two contigs. (c) Possible overlap, further investigation required.
Figure 3.
Figure 3.
10p Subtelomere region comparison with REXTAL and HG38.
Figure 4.
Figure 4.
16p Subtelomere region comparison with REXTAL and HG38.

References

    1. Armanios M, Alder JK, Parry EM, Karim B, Strong MA, and Greider CW, “Short telomeres are sufficient to cause the degenerative defects associated with aging,” The American Journal of Human Genetics, vol. 85, no. 6, pp. 823–832, 2009. - PMC - PubMed
    1. Meier A et al., “Spreading of mammalian DNA‐damage response factors studied by ChIP-chip at damaged telomeres,” The EMBO journal, vol. 26, no. 11, pp. 2707–2718, 2007. - PMC - PubMed
    1. Davalos AR, Coppe J-P, Campisi J, and Desprez P-Y, “Senescent cells as a source of inflammatory factors for tumor progression,” Cancer and Metastasis Reviews, vol. 29, no. 2, pp. 273283, 2010. - PMC - PubMed
    1. Jaskelioff M et al., “Telomerase reactivation reverses tissue degeneration in aged telomerase-deficient mice,” Nature, vol. 469, no. 7328, p. 102, 2011. - PMC - PubMed
    1. Britt-Compton B, Rowson J, Locke M, Mackenzie I, Kipling D, and Baird DM, “Structural stability and chromosome-specific telomere length is governed by cis-acting determinants in humans,” Human molecular genetics, vol. 15, no. 5, pp. 725–733, 2006. - PubMed

LinkOut - more resources