Towards a comprehensive structural variation map of an individual human genome
- PMID: 20482838
- PMCID: PMC2898065
- DOI: 10.1186/gb-2010-11-5-r52
Towards a comprehensive structural variation map of an individual human genome
Abstract
Background: Several genomes have now been sequenced, with millions of genetic variants annotated. While significant progress has been made in mapping single nucleotide polymorphisms (SNPs) and small (<10 bp) insertion/deletions (indels), the annotation of larger structural variants has been less comprehensive. It is still unclear to what extent a typical genome differs from the reference assembly, and the analysis of the genomes sequenced to date have shown varying results for copy number variation (CNV) and inversions.
Results: We have combined computational re-analysis of existing whole genome sequence data with novel microarray-based analysis, and detect 12,178 structural variants covering 40.6 Mb that were not reported in the initial sequencing of the first published personal genome. We estimate a total non-SNP variation content of 48.8 Mb in a single genome. Our results indicate that this genome differs from the consensus reference sequence by approximately 1.2% when considering indels/CNVs, 0.1% by SNPs and approximately 0.3% by inversions. The structural variants impact 4,867 genes, and >24% of structural variants would not be imputed by SNP-association.
Conclusions: Our results indicate that a large number of structural variants have been unreported in the individual genomes published to date. This significant extent and complexity of structural variants, as well as the growing recognition of their medical relevance, necessitate they be actively studied in health-related analyses of personal genomes. The new catalogue of structural variants generated for this genome provides a crucial resource for future comparison studies.
Figures





Similar articles
-
Genome-wide mapping of copy number variation in humans: comparative analysis of high resolution array platforms.PLoS One. 2011;6(11):e27859. doi: 10.1371/journal.pone.0027859. Epub 2011 Nov 30. PLoS One. 2011. PMID: 22140474 Free PMC article.
-
Systematic inference of copy-number genotypes from personal genome sequencing data reveals extensive olfactory receptor gene content diversity.PLoS Comput Biol. 2010 Nov 11;6(11):e1000988. doi: 10.1371/journal.pcbi.1000988. PLoS Comput Biol. 2010. PMID: 21085617 Free PMC article.
-
The diploid genome sequence of an individual human.PLoS Biol. 2007 Sep 4;5(10):e254. doi: 10.1371/journal.pbio.0050254. PLoS Biol. 2007. PMID: 17803354 Free PMC article.
-
[DNA polymorphisms].Rinsho Byori. 2013 Nov;61(11):1001-7. Rinsho Byori. 2013. PMID: 24450105 Review. Japanese.
-
Strategies for the detection of copy number and other structural variants in the human genome.Hum Genomics. 2006 Jun;2(6):403-14. doi: 10.1186/1479-7364-2-6-403. Hum Genomics. 2006. PMID: 16848978 Free PMC article. Review.
Cited by
-
Utility of long-read sequencing for All of Us.Nat Commun. 2024 Jan 29;15(1):837. doi: 10.1038/s41467-024-44804-3. Nat Commun. 2024. PMID: 38281971 Free PMC article.
-
Copy number variation in the cattle genome.Funct Integr Genomics. 2012 Nov;12(4):609-24. doi: 10.1007/s10142-012-0289-9. Epub 2012 Jul 13. Funct Integr Genomics. 2012. PMID: 22790923 Review.
-
Genetic Risk Profiling in Parkinson's Disease and Utilizing Genetics to Gain Insight into Disease-Related Biological Pathways.Int J Mol Sci. 2020 Oct 4;21(19):7332. doi: 10.3390/ijms21197332. Int J Mol Sci. 2020. PMID: 33020390 Free PMC article. Review.
-
The impact of genomics on pediatric research and medicine.Pediatrics. 2012 Jun;129(6):1150-60. doi: 10.1542/peds.2011-3636. Epub 2012 May 7. Pediatrics. 2012. PMID: 22566424 Free PMC article. Review.
-
Human PTCHD3 nulls: rare copy number and sequence variants suggest a non-essential gene.BMC Med Genet. 2011 Mar 26;12:45. doi: 10.1186/1471-2350-12-45. BMC Med Genet. 2011. PMID: 21439084 Free PMC article.
References
-
- Levy S, Sutton G, Ng PC, Feuk L, Halpern AL, Walenz BP, Axelrod N, Huang J, Kirkness EF, Denisov G, Lin Y, MacDonald JR, Pang AW, Shago M, Stockwell TB, Tsiamouri A, Bafna V, Bansal V, Kravitz SA, Busam DA, Beeson KY, McIntosh TC, Remington KA, Abril JF, Gill J, Borman J, Rogers YH, Frazier ME, Scherer SW, Strausberg RL. et al.The diploid genome sequence of an individual human. PLoS Biol. 2007;5:e254. doi: 10.1371/journal.pbio.0050254. - DOI - PMC - PubMed
-
- Wheeler DA, Srinivasan M, Egholm M, Shen Y, Chen L, McGuire A, He W, Chen YJ, Makhijani V, Roth GT, Gomes X, Tartaro K, Niazi F, Turcotte CL, Irzyk GP, Lupski JR, Chinault C, Song XZ, Liu Y, Yuan Y, Nazareth L, Qin X, Muzny DM, Margulies M, Weinstock GM, Gibbs RA, Rothberg JM. The complete genome of an individual by massively parallel DNA sequencing. Nature. 2008;452:872–876. doi: 10.1038/nature06884. - DOI - PubMed
-
- Bentley DR, Balasubramanian S, Swerdlow HP, Smith GP, Milton J, Brown CG, Hall KP, Evers DJ, Barnes CL, Bignell HR, Boutell JM, Bryant J, Carter RJ, Keira Cheetham R, Cox AJ, Ellis DJ, Flatbush MR, Gormley NA, Humphray SJ, Irving LJ, Karbelashvili MS, Kirk SM, Li H, Liu X, Maisinger KS, Murray LJ, Obradovic B, Ost T, Parkinson ML, Pratt MR. et al.Accurate whole human genome sequencing using reversible terminator chemistry. Nature. 2008;456:53–59. doi: 10.1038/nature07517. - DOI - PMC - PubMed
-
- Wang J, Wang W, Li R, Li Y, Tian G, Goodman L, Fan W, Zhang J, Li J, Zhang J, Guo Y, Feng B, Li H, Lu Y, Fang X, Liang H, Du Z, Li D, Zhao Y, Hu Y, Yang Z, Zheng H, Hellmann I, Inouye M, Pool J, Yi X, Zhao J, Duan J, Zhou Y, Qin J. et al.The diploid genome sequence of an Asian individual. Nature. 2008;456:60–65. doi: 10.1038/nature07484. - DOI - PMC - PubMed
-
- Ley TJ, Mardis ER, Ding L, Fulton B, McLellan MD, Chen K, Dooling D, Dunford-Shore BH, McGrath S, Hickenbotham M, Cook L, Abbott R, Larson DE, Koboldt DC, Pohl C, Smith S, Hawkins A, Abbott S, Locke D, Hillier LW, Miner T, Fulton L, Magrini V, Wylie T, Glasscock J, Conyers J, Sander N, Shi X, Osborne JR, Minx P. et al.DNA sequencing of a cytogenetically normal acute myeloid leukaemia genome. Nature. 2008;456:66–72. doi: 10.1038/nature07485. - DOI - PMC - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Miscellaneous