Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Dec 2:4:31.
doi: 10.1038/s41525-019-0106-7. eCollection 2019.

Re-annotation of 191 developmental and epileptic encephalopathy-associated genes unmasks de novo variants in SCN1A

Affiliations

Re-annotation of 191 developmental and epileptic encephalopathy-associated genes unmasks de novo variants in SCN1A

Charles A Steward et al. NPJ Genom Med. .

Abstract

The developmental and epileptic encephalopathies (DEE) are a group of rare, severe neurodevelopmental disorders, where even the most thorough sequencing studies leave 60-65% of patients without a molecular diagnosis. Here, we explore the incompleteness of transcript models used for exome and genome analysis as one potential explanation for a lack of current diagnoses. Therefore, we have updated the GENCODE gene annotation for 191 epilepsy-associated genes, using human brain-derived transcriptomic libraries and other data to build 3,550 putative transcript models. Our annotations increase the transcriptional 'footprint' of these genes by over 674 kb. Using SCN1A as a case study, due to its close phenotype/genotype correlation with Dravet syndrome, we screened 122 people with Dravet syndrome or a similar phenotype with a panel of exon sequences representing eight established genes and identified two de novo SCN1A variants that now - through improved gene annotation - are ascribed to residing among our exons. These two (from 122 screened people, 1.6%) molecular diagnoses carry significant clinical implications. Furthermore, we identified a previously classified SCN1A intronic Dravet syndrome-associated variant that now lies within a deeply conserved exon. Our findings illustrate the potential gains of thorough gene annotation in improving diagnostic yields for genetic disorders.

Keywords: Medical genomics; Molecular medicine.

PubMed Disclaimer

Conflict of interest statement

Competing interestsC.A.S., A.S.R. and N.J.L. are employed by Congenica Ltd. P.F. is a member of the scientific advisory boards for Fabric Genomics, Inc., and Eagle Genomics, Ltd. S.P. is an employee and D.V. is a postdoc of AstraZeneca. The remaining authors declare they have no competing interests.

Figures

Fig. 1
Fig. 1
Expression of transcripts. Cumulative distribution curves for the number of intron-supporting reads in pre-existing (GENCODE v20) versus updated (GENCODE v28) annotation. Distribution curves for overall transcripts, CDS, 5′ and 3′ UTR are given. The x-axis is in log10 scale.
Fig. 2
Fig. 2
Variants in coding sequence of CDKL5. ENST00000379996 had previously been annotated in GENCODE and represents a known protein-coding transcript; coding exons 17–20 are shown (coding exons in black; UTR in grey). ENST00000623535 was annotated as part of this study and the transcript contains an alternative CDS based on the usage of a different C-terminus, linked to a 3′ UTR sequence that extends into an intron of ENST00000379996. This alternative 3′ UTR has strong support in polyAseq experiments and RNA-Seq assays across multiple tissues (not shown). The 170 bp of CDS added to the intronic region contains 4 ClinVar variants, listed here by their dbSNP I.D. alongside the consequences as presented by ClinVar. Bodian et al. recently reported rs1555955290 as a de novo frameshift variant in a child with early onset seizures. Variants rs1555955296 and rs863225289 are de novo nonsense variants submitted to ClinVar by private testing laboratories, both from people described as having early infantile epileptic encephalopathy 2. In contrast, nonsense variant rs1555955268 is currently classified as ‘likely benign’ by ClinVar; this is a privately submitted germline variant from an individual with an unspecified condition. Additional transcript models within the gene have been omitted for clarity.
Fig. 3
Fig. 3
The updated SCN1A annotation identified 10 exons and five shifted splice junctions, increasing the genomic footprint of SCN1A transcription by ~3 kb. All features are described with respect to existing Ensembl model ENST00000303395 and numbered according to the scheme used in Table 1. For clarity, the features are shown as truncated models containing only the exons of specific interest (and certain features are present on multiple transcript models in the complete gene annotation). UTR sequences are shown in grey, coding or NMD regions in black. Features [1] and [2] represent previously unreported 5′ UTR sequences that have conservation and equivalent expression in mouse and chicken. Features [7] and [14] are cassette exons predicted to invoke NMD and contain the de novo variants identified in the study within patients one and two respectively. Feature 9 is a cassette exon that is an ancient duplication of coding exon five, to which it is transcribed in a mutually exclusive manner; the clinical significance of this exon has been previously demonstrated by Tate et al. Feature 12 is a cassette exon predicted to invoke NMD. Intron and exon sizes are to approximate scale. Additional transcript models have been omitted for clarity.
Fig. 4
Fig. 4
Variants in coding regions are associated with DS. a Pedigrees and Sanger sequencing traces of the two families with a de novo SCN1A variant in the identified poison exons. b The two transcripts containing the variants, relative to the full-length transcript. Red exons are coding, white exons are non-coding. c Variants are predicted to disrupt a hnRNP A1 recognition site.

References

    1. Nieh SE, Sherr EH. Epileptic encephalopathies: new genes and new pathways. Neurotherapeutics. 2014;11:796–806. - PMC - PubMed
    1. EuroEpinomics-R.E.S.Consortium; Epilepsy Phenome/Genome-Project & Epi4K.Consortium. De novo mutations in synaptic transmission genes including DNM1 cause epileptic encephalopathies. Am. J. Hum. Genet.95, 360–370 (2014). - PMC - PubMed
    1. Djemie T, et al. Pitfalls in genetic testing: the story of missed SCN1A mutations. Mol. Genet. Genom. Med. 2016;4:457–464. - PMC - PubMed
    1. Deciphering-Developmental-Disorders-Study. Prevalence and architecture of de novo mutations in developmental disorders. Nature. 2017;542:433–438. - PMC - PubMed
    1. Mark, C. et al. The 100,000 Genomes Project Protocol (2017).