Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul 6;5(1):670.
doi: 10.1038/s42003-022-03610-7.

Comprehensive short and long read sequencing analysis for the Gaucher and Parkinson's disease-associated GBA gene

Affiliations

Comprehensive short and long read sequencing analysis for the Gaucher and Parkinson's disease-associated GBA gene

Marco Toffoli et al. Commun Biol. .

Abstract

GBA variants carriers are at increased risk of Parkinson's disease (PD) and Lewy body dementia (LBD). The presence of pseudogene GBAP1 predisposes to structural variants, complicating genetic analysis. We present two methods to resolve recombinant alleles and other variants in GBA: Gauchian, a tool for short-read, whole-genome sequencing data analysis, and Oxford Nanopore sequencing after PCR enrichment. Both methods were concordant for 42 samples carrying a range of recombinants and GBAP1-related mutations, and Gauchian outperformed the GATK Best Practices pipeline. Applying Gauchian to sequencing of over 10,000 individuals shows that copy number variants (CNVs) spanning GBAP1 are relatively common in Africans. CNV frequencies in PD and LBD are similar to controls. Gains may coexist with other mutations in patients, and a modifying effect cannot be excluded. Gauchian detects more GBA variants in LBD than PD, especially severe ones. These findings highlight the importance of accurate GBA analysis in these patients.

PubMed Disclaimer

Conflict of interest statement

XC and MAE were employees of Illumina Inc. during the research and writing of this manuscript and are now employees of Pacific Biosciences Inc. SWS serve on the Scientific Advisory Council of the Lewy Body Dementia Association. SWS is an editorial board member for the Journal of Parkinson’s Disease and JAMA Neurology. AHVS is supported by the UCLH NIHR BRC. MT, FJS, C-YL, SM, AH, SK, MEG-S, ES and CP declare no competing interests.

Figures

Fig. 1
Fig. 1. Schematic illustration of the different types of GBA recombinant alleles and positions of PCR primers used to detect them with ONT.
Not to scale, corresponding roughly to g.chr1:155,210,000-155,245,000. a Wild-type allele. Only primer pair 1 will produce an amplicon. b Non-reciprocal recombination (gene conversion). Similar to non-recombinant alleles, only primer pair 1 will produce an amplicon. c Reciprocal crossover between gene and pseudogene resulting in a 20.6 kb deletion (CNL). Only primer pair 2 will produce an amplicon. d Reciprocal crossover between gene and pseudogene resulting in a 20.6 kb duplication (CNG). Both primer pair 1 and primer pair 3 will produce amplicons. Note that the normal allele is present and that amplification with primer pair 3 will produce an amplicon independently of the number of copy number gains. CNG copy number gain, CNL copy number loss.
Fig. 2
Fig. 2. Gauchian detects challenging GBA variants through targeted copy number calling and haplotype phasing.
a Median mapping quality (red line) across 2504 1kGP samples plotted for each position in the GBA/GBAP1 region (hg38). A median filter is applied in a 50 bp window. The eleven exons of GBA are shown as orange boxes. GBAP1 and MTX1 exons are shown as green and purple boxes, respectively. The 4 kb major homology region (98.1% sequence similarity, exons 9–11) between GBA and GBAP1 is shaded in pink and highlights an area of low mapping accuracy. The light blue box shows the 10 kb unique region between the two genes in which copy number calling is performed in Gauchian. b Distribution of normalised depth in the 10 kb CN calling region in 2504 1kGP samples, showing peaks at CN1 (CNL), 2 (no CNV) and 3-8 (CNG). c Recombinant haplotypes in the exons 9–11 homology region, distinguished by GBA/GBAP1 differentiating bases (x-axis). Reference genome sequences are shaded in yellow. There is an error in hg38 where the first three sites of GBAP1 show GBA bases, which could lead to alignment errors. The GBA recombinant haplotypes are shown in the white background, including those where one or a few nearby sites are mutated to the corresponding GBAP1 base, resulting from either gene conversion or CNL. Grey bases indicate that the base can be either GBA or GBAP1 depending on the breakpoint position of the CNL/conversion. Shaded in purple are two example GBAP1 haplotypes, found by Gauchian, that have been partially converted to GBA and can cause false-positive GBA variant calls by standard secondary analysis pipelines. For the first example, the reverse-p.L483P variant on GBAP1 directs aligners to align GBAP1 reads to GBA, causing the nearby p.A495P false-positive call. For the second example, the reverse-c.1263del variant inserts 55 bp to GBAP1, driving GBAP1 reads to align to GBA, causing the nearby p.D448H false-positive call. CNG copy number gain, CNL copy number loss, CNV copy number variant.

References

    1. Hruska, K. S., LaMarca, M. E., Scott, C. R. & Sidransky, E. Gaucher disease: mutation and polymorphism spectrum in the glucocerebrosidase gene (GBA). Hum. Mutat.29, 567–583 (2008). - PubMed
    1. Do, J., McKinney, C., Sharma, P. & Sidransky, E. Glucocerebrosidase and its relevance to Parkinson disease. Mol. Neurodegeneration14, 36 (2019). - PMC - PubMed
    1. Sidransky, E. et al. Multicenter analysis of glucocerebrosidase mutations in Parkinson’s disease. N. Engl. J. Med.361, 1651–1661 (2009). - PMC - PubMed
    1. Guerreiro, R. et al. Investigating the genetic architecture of dementia with Lewy bodies: a two-stage genome-wide association study. Lancet Neurol.17, 64–74 (2018). - PMC - PubMed
    1. Horowitz, M. et al. The human glucocerebrosidase gene and pseudogene: structure and evolution. Genomics4, 87–96 (1989). - PubMed

Publication types

Substances