Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Multicenter Study
. 2020 Oct 1;11(1):4932.
doi: 10.1038/s41467-020-18723-y.

Large-scale targeted sequencing identifies risk genes for neurodevelopmental disorders

Tianyun Wang  1 Kendra Hoekzema  1 Davide Vecchio  2   3 Huidan Wu  4 Arvis Sulovari  1 Bradley P Coe  1 Madelyn A Gillentine  1 Amy B Wilfert  1 Luis A Perez-Jurado  5   6   7 Malin Kvarnung  8   9 Yoeri Sleyp  10 Rachel K Earl  11 Jill A Rosenfeld  12   13 Madeleine R Geisheker  1 Lin Han  4 Bing Du  4 Chris Barnett  5   14 Elizabeth Thompson  5 Marie Shaw  14 Renee Carroll  14 Kathryn Friend  15 Rachael Catford  15 Elizabeth E Palmer  16   17 Xiaobing Zou  18 Jianjun Ou  19 Honghui Li  20 Hui Guo  4 Jennifer Gerdts  11 Emanuela Avola  21 Giuseppe Calabrese  21 Maurizio Elia  21 Donatella Greco  21 Anna Lindstrand  8   9 Ann Nordgren  8   9 Britt-Marie Anderlid  8   9 Geert Vandeweyer  22 Anke Van Dijck  22 Nathalie Van der Aa  22 Brooke McKenna  23 Miroslava Hancarova  24 Sarka Bendova  24 Marketa Havlovicova  24 Giovanni Malerba  25 Bernardo Dalla Bernardina  26 Pierandrea Muglia  27 Arie van Haeringen  28 Mariette J V Hoffer  28 Barbara Franke  29   30 Gerarda Cappuccio  31   32 Martin Delatycki  33 Paul J Lockhart  33   34 Melanie A Manning  35   36 Pengfei Liu  12   13 Ingrid E Scheffer  33   37   38   39 Nicola Brunetti-Pierri  31   32 Nanda Rommelse  30   40 David G Amaral  41 Gijs W E Santen  28 Elisabetta Trabetti  25 Zdeněk Sedláček  24 Jacob J Michaelson  42 Karen Pierce  43 Eric Courchesne  43 R Frank Kooy  22 SPARK ConsortiumMagnus Nordenskjöld  8   9 Corrado Romano  21 Hilde Peeters  10 Raphael A Bernier  11 Jozef Gecz  6   14   15 Kun Xia  4   44 Evan E Eichler  45   46
Collaborators, Affiliations
Multicenter Study

Large-scale targeted sequencing identifies risk genes for neurodevelopmental disorders

Tianyun Wang et al. Nat Commun. .

Erratum in

  • Author Correction: Large-scale targeted sequencing identifies risk genes for neurodevelopmental disorders.
    Wang T, Hoekzema K, Vecchio D, Wu H, Sulovari A, Coe BP, Gillentine MA, Wilfert AB, Perez-Jurado LA, Kvarnung M, Sleyp Y, Earl RK, Rosenfeld JA, Geisheker MR, Han L, Du B, Barnett C, Thompson E, Shaw M, Carroll R, Friend K, Catford R, Palmer EE, Zou X, Ou J, Li H, Guo H, Gerdts J, Avola E, Calabrese G, Elia M, Greco D, Lindstrand A, Nordgren A, Anderlid BM, Vandeweyer G, Van Dijck A, Van der Aa N, McKenna B, Hancarova M, Bendova S, Havlovicova M, Malerba G, Bernardina BD, Muglia P, van Haeringen A, Hoffer MJV, Franke B, Cappuccio G, Delatycki M, Lockhart PJ, Manning MA, Liu P, Scheffer IE, Brunetti-Pierri N, Rommelse N, Amaral DG, Santen GWE, Trabetti E, Sedláček Z, Michaelson JJ, Pierce K, Courchesne E, Kooy RF; SPARK Consortium; Nordenskjöld M, Romano C, Peeters H, Bernier RA, Gecz J, Xia K, Eichler EE. Wang T, et al. Nat Commun. 2020 Oct 21;11(1):5398. doi: 10.1038/s41467-020-19289-5. Nat Commun. 2020. PMID: 33087701 Free PMC article.

Abstract

Most genes associated with neurodevelopmental disorders (NDDs) were identified with an excess of de novo mutations (DNMs) but the significance in case-control mutation burden analysis is unestablished. Here, we sequence 63 genes in 16,294 NDD cases and an additional 62 genes in 6,211 NDD cases. By combining these with published data, we assess a total of 125 genes in over 16,000 NDD cases and compare the mutation burden to nonpsychiatric controls from ExAC. We identify 48 genes (25 newly reported) showing significant burden of ultra-rare (MAF < 0.01%) gene-disruptive mutations (FDR 5%), six of which reach family-wise error rate (FWER) significance (p < 1.25E-06). Among these 125 targeted genes, we also reevaluate DNM excess in 17,426 NDD trios with 6,499 new autism trios. We identify 90 genes enriched for DNMs (FDR 5%; e.g., GABRG2 and UIMC1); of which, 61 reach FWER significance (p < 3.64E-07; e.g., CASZ1). In addition to doubling the number of patients for many NDD risk genes, we present phenotype-genotype correlations for seven risk genes (CTCF, HNRNPU, KCNQ3, ZBTB18, TCF12, SPEN, and LEO1) based on this large-scale targeted sequencing effort.

PubMed Disclaimer

Conflict of interest statement

E.E.E. is on the scientific advisory board (SAB) of DNAnexus, Inc. The Department of Molecular and Human Genetics at Baylor College of Medicine receives revenue from clinical genetic testing conducted at Baylor Genetics. The other authors have no competing interests to declare.

Figures

Fig. 1
Fig. 1. Overview of study design.
Targeted sequencing was performed in probands for two gene panels: NDD1 (63 genes) and hcNDD (62 genes). Gene and variant counts are after QC. The same categories of variants were retrieved from three previously published smMIP studies for 62 hcNDD genes. All smMIP variants were combined; redundant samples were eliminated and compared to the same category of variants from ExAC non-psych controls. The number of variants is after the exclusion of false positive variants and variants with insufficient coverage in ExAC. Mutation burden analysis identified 48 FDR significant genes (qmutBurden < 0.05, Benjamini–Hochberg correction for 125 genes), of which six reached FWER significance (pmutBurden < 1.25E−06, Bonferroni correction for 20,000 genes and two tests); DNMs of the 125 genes used in this study were identified from exome sequencing in 10,927 published NDD trios and 6,499 new ASD trios that combined as 17,426 NDD parent–child trios. A separate de novo enrichment analysis, using two statistical methods (CH model and denovolyzeR), identified 90 FDR significant genes (qdnEnrich < 0.05, Benjamini–Hochberg correction for 18,946 genes in CH model and 19,618 genes in denovolyzeR), of which, 61 genes reach FWER significance (pdnEnrich < 3.64E−07, Bonferroni correction for 19,618 genes and seven tests) for excess DNM. There is a significant overlap (40 genes) of the significant genes suggested by the two approaches. Then we performed genotype–phenotype correlation analysis for seven NDD risk genes (CTCF, HNRNPU, KCNQ3, ZBTB18, TCF12, SPEN, and LEO1) and present a clearer clinical picture of each gene.
Fig. 2
Fig. 2. Significant genes identified from mutation burden and de novo enrichment analyses.
a Mutation burden analysis identified 48 genes significant for LGD and/or MIS30 variants in smMIP sequencing compared with the ExAC (r0.3) non-psych subset controls; each dot indicates a gene and the color indicates the category of variant showing significance for the gene (red for LGD, blue for MIS30, and black for both LGD and MIS30). b The CH model and denovolyzeR show high concordance for genes with significant excess of DNM at both FDR and FWER levels. c A union set of 90 genes showing excess DNM (FDR 5%) in de novo enrichment analysis. Gray dashed box in top panel is shown in bottom panel for a zoom view. See Supplementary Data 10 for underlying data.
Fig. 3
Fig. 3. Severe variants and the genotype–phenotype correlations in CTCF.
a LGD (red) and MIS30 (blue) variants are depicted against a protein model for CTCF. Variants new to this study are shown above the protein while published DNMs from denovo-db (v1.5) are below. Variants are flagged with yellow lightning bolt if de novo. Annotated protein domains are shown (colored blocks) for the largest protein isoforms. b Heatmap depicts the common clinical features for patients carrying CTCF severe variants by using the specific HPO annotation (rows), which were retrieved from published studies and our cohort (columns). Phenotypic enrichment is shown according to the features’ recurrence labeled by the increment of color degree. The items with no data available were labeled with “-” and were excluded in the frequency analysis.
Fig. 4
Fig. 4. Distribution of severe patient variants in six genes.
Protein diagrams are shown for HNRNPU (a) KCNQ3 (b) ZBTB18 (c) TCF12 (d) SPEN (e), and LEO1 (f) with the same display metrics that applied in Fig. 3. Validated LGD (red) and MIS30 (blue) variants are plotted. Variants listed above the protein model are new to this study, while the ones below were published previously. Paternal (black arrow) and maternal (green arrow) inheritance are shown if determined. A yellow lightning bolt denotes a de novo mutation.

References

    1. First MB. Diagnostic and statistical manual of mental disorders, 5th edition, and clinical utility. J. Nerv. Ment. Dis. 2013;201:727–729. doi: 10.1097/NMD.0b013e3182a2168a. - DOI - PubMed
    1. O’Roak BJ, et al. Multiplex targeted sequencing identifies recurrently mutated genes in autism spectrum disorders. Science. 2012;338:1619–1622. doi: 10.1126/science.1227764. - DOI - PMC - PubMed
    1. Stessman HA, et al. Targeted sequencing identifies 91 neurodevelopmental-disorder risk genes with autism and developmental-disability biases. Nat. Genet. 2017;49:515–526. doi: 10.1038/ng.3792. - DOI - PMC - PubMed
    1. Wang T, et al. De novo genic mutations among a Chinese autism spectrum disorder cohort. Nat. Commun. 2016;7:13316. doi: 10.1038/ncomms13316. - DOI - PMC - PubMed
    1. Guo H, et al. Inherited and multiple de novo mutations in autism/developmental delay risk genes suggest a multifactorial model. Mol. Autism. 2018;9:64. doi: 10.1186/s13229-018-0247-z. - DOI - PMC - PubMed

Publication types

MeSH terms