Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome
- PMID: 28263316
- PMCID: PMC5909822
- DOI: 10.1038/ng.3802
Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome
Abstract
The decrease in sequencing cost and increased sophistication of assembly algorithms for short-read platforms has resulted in a sharp increase in the number of species with genome assemblies. However, these assemblies are highly fragmented, with many gaps, ambiguities, and errors, impeding downstream applications. We demonstrate current state of the art for de novo assembly using the domestic goat (Capra hircus) based on long reads for contig formation, short reads for consensus validation, and scaffolding by optical and chromatin interaction mapping. These combined technologies produced what is, to our knowledge, the most continuous de novo mammalian assembly to date, with chromosome-length scaffolds and only 649 gaps. Our assembly represents a ∼400-fold improvement in continuity due to properly assembled gaps, compared to the previously published C. hircus assembly, and better resolves repetitive structures longer than 1 kb, representing the largest repeat family and immune gene complex yet produced for an individual of a ruminant species.
Conflict of interest statement
Figures





Similar articles
-
Chromosome-level assembly of the water buffalo genome surpasses human and goat genomes in sequence contiguity.Nat Commun. 2019 Jan 16;10(1):260. doi: 10.1038/s41467-018-08260-0. Nat Commun. 2019. PMID: 30651564 Free PMC article.
-
De novo assembly of a young Drosophila Y chromosome using single-molecule sequencing and chromatin conformation capture.PLoS Biol. 2018 Jul 30;16(7):e2006348. doi: 10.1371/journal.pbio.2006348. eCollection 2018 Jul. PLoS Biol. 2018. PMID: 30059545 Free PMC article.
-
Scaffolding of long read assemblies using long range contact information.BMC Genomics. 2017 Jul 12;18(1):527. doi: 10.1186/s12864-017-3879-z. BMC Genomics. 2017. PMID: 28701198 Free PMC article.
-
The present and future of de novo whole-genome assembly.Brief Bioinform. 2018 Jan 1;19(1):23-40. doi: 10.1093/bib/bbw096. Brief Bioinform. 2018. PMID: 27742661 Review.
-
PacBio Sequencing and Its Applications.Genomics Proteomics Bioinformatics. 2015 Oct;13(5):278-89. doi: 10.1016/j.gpb.2015.08.002. Epub 2015 Nov 2. Genomics Proteomics Bioinformatics. 2015. PMID: 26542840 Free PMC article. Review.
Cited by
-
Whole-genome resource sequences of 57 indigenous Ethiopian goats.Sci Data. 2024 Jan 29;11(1):139. doi: 10.1038/s41597-024-02973-2. Sci Data. 2024. PMID: 38287052 Free PMC article.
-
Genome-wide association study reveals novel candidate genes for litter size in Markhoz goats.Front Vet Sci. 2022 Nov 23;9:1045589. doi: 10.3389/fvets.2022.1045589. eCollection 2022. Front Vet Sci. 2022. PMID: 36504837 Free PMC article.
-
Database Resources of the BIG Data Center in 2019.Nucleic Acids Res. 2019 Jan 8;47(D1):D8-D14. doi: 10.1093/nar/gky993. Nucleic Acids Res. 2019. PMID: 30365034 Free PMC article.
-
Integrating Hi-C links with assembly graphs for chromosome-scale assembly.PLoS Comput Biol. 2019 Aug 21;15(8):e1007273. doi: 10.1371/journal.pcbi.1007273. eCollection 2019 Aug. PLoS Comput Biol. 2019. PMID: 31433799 Free PMC article.
-
Chromosome-scale genome assembly of Eustoma grandiflorum, the first complete genome sequence in the genus Eustoma.G3 (Bethesda). 2023 Feb 9;13(2):jkac329. doi: 10.1093/g3journal/jkac329. G3 (Bethesda). 2023. PMID: 36529465 Free PMC article.
References
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources