Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan 10;12(2):320.
doi: 10.3390/plants12020320.

Structural Refinement by Direct Mapping Reveals Assembly Inconsistencies near Hi-C Junctions

Affiliations

Structural Refinement by Direct Mapping Reveals Assembly Inconsistencies near Hi-C Junctions

Luca Marcolungo et al. Plants (Basel). .

Abstract

High-throughput chromosome conformation capture (Hi-C) is widely used for scaffolding in de novo assembly because it produces highly contiguous genomes, but its indirect statistical approach can introduce connection errors. We employed optical mapping (Bionano Genomics) as an orthogonal scaffolding technology to assess the structural solidity of Hi-C reconstructed scaffolds. Optical maps were used to assess the correctness of five de novo genome assemblies based on long-read sequencing for contig generation and Hi-C for scaffolding. Hundreds of inconsistencies were found between the reconstructions generated using the Hi-C and optical mapping approaches. Manual inspection, exploiting raw long-read sequencing data and optical maps, confirmed that several of these conflicts were derived from Hi-C joining errors. Such misjoins were widespread, involved the connection of both small and large contigs, and even overlapped annotated genes. We conclude that the integration of optical mapping data after, not before, Hi-C-based scaffolding, improves the quality of the assembly and limits reconstruction errors by highlighting misjoins that can then be subjected to further investigation.

Keywords: Hi-C; assembly refinement; de novo genome assembly; optical mapping.

PubMed Disclaimer

Conflict of interest statement

Author E.C. is an employee of Genartis srl. Authors M.R. and M.D. are partners of Genartis srl. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Example of inconsistency between optical maps and Hi-C-derived scaffolds in the H. lacustris assembly. (A) Alignment to the NGS assembly scaffolded with Hi-C. Red and green arrows indicate the incongruence points in the BspQI and the BssSI maps, respectively. (B) Raw Bionano molecules aligned on the two maps involved in the conflict.
Figure 2
Figure 2
Integrative Genome Browser of two junction regions identified as potential errors by optical maps. ONT data mapped to the Hi-C-scaffolded H. lacustris genome show either (A) no reads spanning the contig junction or (B) reads spanning the junction but highlighting the incorrect gap size.
Figure 3
Figure 3
Integrative Genome Browser, visualisation of long PacBio, reads aligned on an OM/Hi-C conflict identified on the L. albus genome assemblies. (A) PacBio reads aligned on the PacBio + Hi-C + OM (after anchoring) assembly support the OM-based reconstruction. (B) PacBio reads aligned on the PacBio + Hi-C assembly do not support the HiC-based reconstruction.

References

    1. Lee J.Y., Kong M., Oh J., Lim J.S., Chung S.H., Kim J.M., Kim J.S., Kim K.H., Yoo J.C., Kwak W. Comparative Evaluation of Nanopore Polishing Tools for Microbial Genome Assembly and Polishing Strategies for Downstream Analysis. Sci. Rep. 2021;11:20740. doi: 10.1038/s41598-021-00178-w. - DOI - PMC - PubMed
    1. Wick R.R., Holt K.E. Polypolish: Short-read polishing of long-read bacterial genome assemblies. PLoS Comput. Biol. 2022;18:e1009802. doi: 10.1371/journal.pcbi.1009802. - DOI - PMC - PubMed
    1. Xue Z., Chen-guang L., Shi-hui Y., Xia W., Feng-wu B., Zhuo W. Benchmarking of long-read sequencing, assemblers and polishers for yeast genome. Brief. Bioinform. 2022;23:bbac146. - PubMed
    1. Chen Z., Erickson D.L., Meng J. Polishing the Oxford nanopore long-read assemblies of bacterial pathogens with illumina short reads to improve genomic analyses. Genomics. 2021;113:1366–1377. doi: 10.1016/j.ygeno.2021.03.018. - DOI - PubMed
    1. Liu J., Seetharam A.S., Chougule K., Ou S., Swentowsky K.W., Gent J.I., Llaca V., Woodhouse M.R., Manchanda N., Presting G.G., et al. Gapless assembly of maize chromosomes using long-read technologies. Genome Biol. 2020;21:121. doi: 10.1186/s13059-020-02029-9. - DOI - PMC - PubMed

LinkOut - more resources