Multiscale analysis of pangenomes enables improved representation of genomic diversity for repetitive and clinically relevant genes
- PMID: 37365340
- PMCID: PMC10406601
- DOI: 10.1038/s41592-023-01914-y
Multiscale analysis of pangenomes enables improved representation of genomic diversity for repetitive and clinically relevant genes
Abstract
Advancements in sequencing technologies and assembly methods enable the regular production of high-quality genome assemblies characterizing complex regions. However, challenges remain in efficiently interpreting variation at various scales, from smaller tandem repeats to megabase rearrangements, across many human genomes. We present a PanGenome Research Tool Kit (PGR-TK) enabling analyses of complex pangenome structural and haplotype variation at multiple scales. We apply the graph decomposition methods in PGR-TK to the class II major histocompatibility complex demonstrating the importance of the human pangenome for analyzing complicated regions. Moreover, we investigate the Y-chromosome genes, DAZ1/DAZ2/DAZ3/DAZ4, of which structural variants have been linked to male infertility, and X-chromosome genes OPN1LW and OPN1MW linked to eye disorders. We further showcase PGR-TK across 395 complex repetitive medically important genes. This highlights the power of PGR-TK to resolve complex variation in regions of the genome that were previously too complex to analyze.
© 2023. The Author(s), under exclusive licence to Springer Nature America, Inc.
Conflict of interest statement
C.-S.C. is an employee and shareholder of GeneDX. F.J.S. obtains research support from Illumina, PacBio and Oxford Nanopore. The remaining authors declare no competing interests.
Figures




References
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials