Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Nov 10;185(23):4409-4427.e18.
doi: 10.1016/j.cell.2022.10.009.

Genomic architecture of autism from comprehensive whole-genome sequence annotation

Brett Trost  1 Bhooma Thiruvahindrapuram  2 Ada J S Chan  1 Worrawat Engchuan  1 Edward J Higginbotham  1 Jennifer L Howe  2 Livia O Loureiro  1 Miriam S Reuter  3 Delnaz Roshandel  4 Joe Whitney  2 Mehdi Zarrei  1 Matthew Bookman  5 Cherith Somerville  6 Rulan Shaath  2 Mona Abdi  7 Elbay Aliyev  8 Rohan V Patel  2 Thomas Nalpathamkalam  2 Giovanna Pellecchia  2 Omar Hamdan  2 Gaganjot Kaur  2 Zhuozhi Wang  2 Jeffrey R MacDonald  2 John Wei  2 Wilson W L Sung  2 Sylvia Lamoureux  2 Ny Hoang  9 Thanuja Selvanayagam  10 Nicole Deflaux  5 Melissa Geng  11 Siavash Ghaffari  1 John Bates  5 Edwin J Young  12 Qiliang Ding  6 Carole Shum  1 Lia D'Abate  1 Clarrisa A Bradley  13 Annabel Rutherford  14 Vernie Aguda  2 Beverly Apresto  2 Nan Chen  2 Sachin Desai  2 Xiaoyan Du  2 Matthew L Y Fong  2 Sanjeev Pullenayegum  2 Kozue Samler  2 Ting Wang  2 Karen Ho  2 Tara Paton  2 Sergio L Pereira  2 Jo-Anne Herbrick  2 Richard F Wintle  2 Jonathan Fuerth  15 Juti Noppornpitak  15 Heather Ward  15 Patrick Magee  15 Ayman Al Baz  15 Usanthan Kajendirarajah  15 Sharvari Kapadia  15 Jim Vlasblom  15 Monica Valluri  15 Joseph Green  15 Vicki Seifer  16 Morgan Quirbach  16 Olivia Rennie  2 Elizabeth Kelley  17 Nina Masjedi  18 Catherine Lord  18 Michael J Szego  19 Ma'n H Zawati  20 Michael Lang  20 Lisa J Strug  21 Christian R Marshall  22 Gregory Costain  23 Kristina Calli  24 Alana Iaboni  25 Afiqah Yusuf  26 Patricia Ambrozewicz  27 Louise Gallagher  28 David G Amaral  29 Jessica Brian  30 Mayada Elsabbagh  26 Stelios Georgiades  31 Daniel S Messinger  32 Sally Ozonoff  29 Jonathan Sebat  33 Calvin Sjaarda  34 Isabel M Smith  35 Peter Szatmari  36 Lonnie Zwaigenbaum  37 Azadeh Kushki  38 Thomas W Frazier  39 Jacob A S Vorstman  40 Khalid A Fakhro  41 Bridget A Fernandez  42 M E Suzanne Lewis  24 Rosanna Weksberg  43 Marc Fiume  15 Ryan K C Yuen  11 Evdokia Anagnostou  30 Neal Sondheimer  44 David Glazer  5 Dean M Hartley  16 Stephen W Scherer  45
Affiliations

Genomic architecture of autism from comprehensive whole-genome sequence annotation

Brett Trost et al. Cell. .

Abstract

Fully understanding autism spectrum disorder (ASD) genetics requires whole-genome sequencing (WGS). We present the latest release of the Autism Speaks MSSNG resource, which includes WGS data from 5,100 individuals with ASD and 6,212 non-ASD parents and siblings (total n = 11,312). Examining a wide variety of genetic variants in MSSNG and the Simons Simplex Collection (SSC; n = 9,205), we identified ASD-associated rare variants in 718/5,100 individuals with ASD from MSSNG (14.1%) and 350/2,419 from SSC (14.5%). Considering genomic architecture, 52% were nuclear sequence-level variants, 46% were nuclear structural variants (including copy-number variants, inversions, large insertions, uniparental isodisomies, and tandem repeat expansions), and 2% were mitochondrial variants. Our study provides a guidebook for exploring genotype-phenotype correlations in families who carry ASD-associated rare variants and serves as an entry point to the expanded studies required to dissect the etiology in the ∼85% of the ASD population that remain idiopathic.

Keywords: autism spectrum disorder; copy-number variation; neurodevelopmental disorders; phenotype measures; polygenic risk scores; rare variants; structural variation; whole-genome sequencing.

PubMed Disclaimer

Conflict of interest statement

Declaration of interests E. Anagnostou has received consultation fees from Roche, Quadrant, and Oron; grant funding from Roche; in-kind supports from AMO Pharma and CRR; editorial honoraria from Wiley; and book royalties from APPI and Springer. She co-holds a patent for the device Anxiety Meter (patent # US20160000365A1). S.W.S. is on the Scientific Advisory Committee of Population Bio and serves as a Highly Cited Academic Advisor for King Abdulaziz University, and intellectual property from aspects of his research held at The Hospital for Sick Children are licensed to Athena Diagnostics and Population Bio. These relationships did not influence data interpretation or presentation during this study but are disclosed for potential future considerations.

Figures

Figure 1.
Figure 1.. Data Processing and Analysis Workflow.
The Genetic Variant Data section applies to all samples except 1,738 sequenced by Complete Genomics, which used proprietary software. Although CNVs and TREs are types of SVs, they are shown separately because different methods were used for their detection. CNVs include deletions and duplications ≥1 kb, while SVs include deletions, duplications, insertions, and inversions ≥50 bp.
Figure 2.
Figure 2.. Identification of ASD-Associated Genes Using TADA+.
(A) Exonic de novo variants per individual in ASC, MSSNG, and SPARK. (B) ASD-associated genes discovered only in the previous TADA+ analysis (“ASC”), only in the current analysis (“ASC+MSSNG+SPARK”), or both. (C) Distribution of SFARI Gene scores for the newly discovered genes. (D) FDRs for the 134 ASD-associated genes. Blue dots: genes also identified in the previous TADA+ analysis; green dots: new genes. Genes with FDR calculated as zero were assigned a value of 10−17. The blue line represents FDR=0.1. (E) Evidence supporting the new ASD-associated genes. “Case-control difference” represents PTVs in cases minus controls. The y-axis is truncated at −3; the value for MIB1 is −9. (F) pLI values for the new genes. (G) Network diagram of TADA+ genes. Only genes connected to gene networks are shown. Nodes represent genes, and edges represent protein-protein and pathway interactions between gene pairs. Modules are indicated by blue circles, with module labels derived from GO term enrichment tests (bold: significantly enriched).
Figure 3.
Figure 3.. Examples of Pathogenic SVs.
(A-C) 71 bp de novo frameshift insertion in SYNGAP1 comprising 65 bp of mtDNA and a 6 bp microduplication. (A) Schematic of SYNGAP1, with the insertion indicated by a red arrow. (B) Alignment of the insertion-containing contig sequence assembled by Manta, the reference sequence, and the mtDNA sequence inserted into chromosome 6. The microduplication is indicated in bold. (C) IGV visualization. The colored portions of reads represent mismatched bases, allowing precise breakpoint identification. The read depth increase reflects the 6 bp microduplication. (D-E) 13.4 kb inversion overlapping SCN2A. (D) Sequence trace showing the 5’ and 3’ breakpoints. (E) IGV visualization. The dark and light blue lines indicate anomalously mapped read pairs exhibiting the signature of an inversion.
Figure 4.
Figure 4.. Genomic Architecture of Rare Variants in ASD.
(A) Burden analysis of sequence-level rare coding variants. Left, individuals with ASD versus non-ASD siblings. Middle, probands from MSSNG MPX families versus those from either SSC SPX families (having exactly one individual with ASD and at least one sibling without) or MSSNG non-MPX families (having exactly one individual with ASD and no siblings). Right, same as middle except siblings with ASD instead of probands. Burdens for other rare ASD-associated variants are given in Table S5A. Compared with sequence-level variants, many SV types had very high ORs, including chromosomal abnormalities (OR=4.9), genomic disorders (OR=8.3), and SVs impacting ASD genes (OR=24) (comparisons between MSSNG individuals with ASD and SSC non-ASD siblings). (B) Yield of ASD-associated rare variants (top) and stratified by variant type (bottom). “Multiple” indicates individuals with ASD-associated variants in more than one category. (C) Distributions of consensus phenotype measures for individuals with ASD having each type of ASD-associated rare variant, or no variant. *Nominally significant (p<0.05); **: significant after multiple testing correction.
Figure 5.
Figure 5.. Detailed View of the Genomic Architecture of Rare Variants in ASD.
(A) De novo or rare inherited (gnomAD allele frequency <10−4) PTVs and de novo DMis variants in autosomal genes identified by the TADA+ analysis or evaluated as definitive by EAGLE. For X-linked genes, we identified hemizygous PTVs in males with frequency <10−4 in constrained genes (pLI > 0.95) found in the Genomics England neurology and neurodevelopmental disorders panel. (B) Recessive events (PTV-PTV events in genes from the Genomics England panel). (C) Chromosomal abnormalities. (D) Genomic disorders. (E) Large or gene-rich CNVs other than genomic disorders. (F) SVs disrupting ASD-associated genes. (G) UPDs. (H) TREs identified previously. (I) Pathogenic mtDNA variants. CPEO, chronic progressive external ophthalmoplegia; MNGIE, mitochondrial neurogastrointestinal encephalomyopathy.
Figure 6.
Figure 6.. PRS Analysis.
(A) Reproducibility in ten pairs of monozygotic (MZ) twins from MSSNG. (B) PRS distributions in MSSNG, SSC, and 1000G. (C) Comparison between individuals with ASD and non-ASD siblings. (D) Over-transmission of polygenic risk from parents to children. (E) Odds ratio of individuals with ASD to those without in each PRS decile, relative to decile 1, in MSSNG and SSC combined. (F) Pedigrees showing the PRS for each individual in two large MPX families. (G) Association between PRS and the four consensus phenotype measures.

References

    1. American Psychiatric Association (2013). Diagnostic and Statistical Manual of Mental Disorders 5th ed. (American Psychiatric Publishing; ).
    1. Lord C, Brugha TS, Charman T, Cusack J, Dumas G, Frazier T, Jones EJH, Jones RM, Pickles A, State MW, et al. (2020). Autism spectrum disorder. Nat Rev Dis Primers 6, 5. 10.1038/s41572-019-0138-4. - DOI - PMC - PubMed
    1. Public Health Agency of Canada (2021). Autism Spectrum Disorder: Highlights from the 2019 Canadian Health Survey on Children and Youth (https://www.canada.ca/content/dam/phac-aspc/documents/services/publicati...).
    1. Centers for Disease Control and Prevention (2022). Autism and Developmental Disabilities Monitoring (ADDM) Network (https://www.cdc.gov/ncbddd/autism/addm.html). Centers for Disease Control and Prevention. https://www.cdc.gov/ncbddd/autism/addm.html.
    1. Doshi-Velez F, Ge Y, and Kohane I (2014). Comorbidity clusters in autism spectrum disorders: An electronic health record time-series analysis. Pediatrics 133. 10.1542/peds.2013-0819. - DOI - PMC - PubMed

Publication types