Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Feb 5;190(5):E126-E136.
doi: 10.1503/cmaj.171151.

The Personal Genome Project Canada: findings from whole genome sequences of the inaugural 56 participants

Affiliations

The Personal Genome Project Canada: findings from whole genome sequences of the inaugural 56 participants

Miriam S Reuter et al. CMAJ. .

Abstract

Background: The Personal Genome Project Canada is a comprehensive public data resource that integrates whole genome sequencing data and health information. We describe genomic variation identified in the initial recruitment cohort of 56 volunteers.

Methods: Volunteers were screened for eligibility and provided informed consent for open data sharing. Using blood DNA, we performed whole genome sequencing and identified all possible classes of DNA variants. A genetic counsellor explained the implication of the results to each participant.

Results: Whole genome sequencing of the first 56 participants identified 207 662 805 sequence variants and 27 494 copy number variations. We analyzed a prioritized disease-associated data set (n = 1606 variants) according to standardized guidelines, and interpreted 19 variants in 14 participants (25%) as having obvious health implications. Six of these variants (e.g., in BRCA1 or mosaic loss of an X chromosome) were pathogenic or likely pathogenic. Seven were risk factors for cancer, cardiovascular or neurobehavioural conditions. Four other variants - associated with cancer, cardiac or neurodegenerative phenotypes - remained of uncertain significance because of discrepancies among databases. We also identified a large structural chromosome aberration and a likely pathogenic mitochondrial variant. There were 172 recessive disease alleles (e.g., 5 individuals carried mutations for cystic fibrosis). Pharmacogenomics analyses revealed another 3.9 potentially relevant genotypes per individual.

Interpretation: Our analyses identified a spectrum of genetic variants with potential health impact in 25% of participants. When also considering recessive alleles and variants with potential pharmacologic relevance, all 56 participants had medically relevant findings. Although access is mostly limited to research, whole genome sequencing can provide specific and novel information with the potential of major impact for health care.

PubMed Disclaimer

Conflict of interest statement

Competing interests: Stephen Scherer serves on the Scientific Advisory Committees of Population Bio and Deep Genomics. Sherilyn Bell, Jo-Anne Herbrick, Jennifer Howe, Ann Joseph-George, Barbara Kellam, Chao Lu, Jeffrey MacDonald, Christian Marshall, Thomas Nalpathamkalam, Rohan Patel, Tara Paton, Giovanna Pellecchia, Sergio Pereira, Miriam Reuter, Stephen Scherer, Lisa Strug, Wilson Sung, Bhooma Thiruvahindrapuram, Susan Walker, Zhuozhi Wang, John Wei, Joe Whitney, Richard Wintle and Ryan Yuen have received grants from Genome Canada/Ontario Genomics; Canada Foundation for Innovation; McLaughlin Centre, University of Toronto; the Government of Ontario, Canadian Institutes of Health Research (CIHR); and the The Hospital for Sick Children Foundation during the conduct of the study. James Ellis, Matthew Hildebrandt, Hin Lee, Peter Pasceri and Wei Wei have received a grant from the McLaughlin Centre, University of Toronto. Daniele Merico is an employee of Deep Genomics. Brett Trost has received a postdoctoral fellowship from CIHR. No other competing interests were declared.

Figures

Figure 1:
Figure 1:
Personal Genome Project Canada (PGPC) workflow. Potential participants in the project must meet eligibility criteria and undergo an entrance examination. Consent is sought and reaffirmed at stages throughout the process. Research ethics board protocols and consents, and genome data files are available at www.personalgenomes.ca. For adjunct analyses, we also assayed 55 samples using high-resolution microarrays (Affymetrix Cytoscan-HD) (Appendix 1), and we generated induced pluripotent stem cell lines for 3 individuals (Figure S4, Appendix 1).
Figure 2:
Figure 2:
Analysis and interpretation of whole genome sequencing data. We analyzed data from WGS for variants in the nuclear and mitochondrial genome: single nucleotide variants (SNVs; alternate single bases), insertion/deletions (indels; small segments of DNA that are missing or replicated), structural variants (SVs; variations involving larger segments), including copy number variants (CNVs; deletions/losses or duplications/gains), as well as other rearrangements (inversions or translocations). We also analyzed the data for 391 variants in 14 pharmacogenes (Table S2, Appendix 3). *Two variants (in MUTYH and PCDH15) were both recessive pathogenic and dominant risk factors. Note: B = benign, CGD = Clinical Genomic Database, HGMD = Human Gene Mutation Database, P = pathogenic, LB = likely benign, LP = likely pathogenic, RF = risk factor, VUS = variant of uncertain significance, WGS = whole genome sequencing.
Figure 3:
Figure 3:
Decline in the number of novel variants as additional samples were analyzed. The burden of new variants drops with each sample analyzed, in particular the number of variants classified as benign/likely benign and uncertain significance. For each of the 55 genetically unrelated participants (excluding the child in a mother–father–child trio) in the Personal Genome Project Canada, single nucleotide variants, insertion/deletions and copy number variants that overlapped genes from the Clinical Genomic Database were classified as benign/likely benign, variant of uncertain significance or pathogenic/likely pathogenic according to American College of Medical Genetics and Genomics interpretation guidelines. We then performed 1 million simulations; in each simulation, we randomly assigned the order of the samples, and the number of variants found in sample i that were not found in any of the samples 1, 2, …, i − 1 was calculated for each i, 1 ≤ i ≤ 55. The lines indicate the number of new variants for each value of i (averaged over the million simulations), whereas the shaded areas represent ±1 standard deviation from the mean, for each of the 3 variant categories: benign (green), variant of uncertain significance (blue) and pathogenic (red).

Comment in

References

    1. Shendure J, Balasubramanian S, Church GM, et al. DNA sequencing at 40: past, present and future. Nature 2017;550:345–53. - PubMed
    1. Lionel AC, Costain G, Monfared N, et al. Improved diagnostic yield compared with targeted gene sequencing panels suggests a role for whole-genome sequencing as a first-tier genetic test. Genet Med 2017. August 3 [Epub ahead of print]. 10.1038/gim.2017.119. - DOI - PMC - PubMed
    1. Stavropoulos DJ, Merico D, Jobling R, et al. Whole genome sequencing expands diagnostic utility and improves clinical management in pediatric medicine. NPJ Genom Med 2016;1:pii:15012. - PMC - PubMed
    1. Jiang YH, Yuen RK, Jin X, et al. Detection of clinically relevant genetic variants in autism spectrum disorder by whole-genome sequencing. Am J Hum Genet 2013; 93:249–63. - PMC - PubMed
    1. Gilissen C, Hehir-Kwa JY, Thung DT, et al. Genome sequencing identifies major causes of severe intellectual disability. Nature 2014;511:344–7. - PubMed