This is a preprint.
An assessment of the genomic structural variation landscape in Sub-Saharan African populations
- PMID: 39041024
- PMCID: PMC11261963
- DOI: 10.21203/rs.3.rs-4485126/v1
An assessment of the genomic structural variation landscape in Sub-Saharan African populations
Abstract
Structural variants are responsible for a large part of genomic variation between individuals and play a role in both common and rare diseases. Databases cataloguing structural variants notably do not represent the full spectrum of global diversity, particularly missing information from most African populations. To address this representation gap, we analysed 1,091 high-coverage African genomes, 545 of which are public data sets, and 546 which have been analysed for structural variants for the first time. Variants were called using five different tools and datasets merged and jointly called using SURVIVOR. We identified 67,795 structural variants throughout the genome, with 10,421 genes having at least one variant. Using a conservative overlap in merged data, 6,414 of the structural variants (9.5%) are novel compared to the Database of Genomic Variants. This study contributes to knowledge of the landscape of structural variant diversity in Africa and presents a reliable dataset for potential applications in population genetics and health-related research.
Keywords: African diversity; Structural variants; copy number variants; genomic variation.
Conflict of interest statement
Additional Declarations: There is NO Competing Interest.
Figures






Similar articles
-
The Extent and Impact of Variation in ADME Genes in Sub-Saharan African Populations.Front Pharmacol. 2021 Apr 28;12:634016. doi: 10.3389/fphar.2021.634016. eCollection 2021. Front Pharmacol. 2021. PMID: 34721006 Free PMC article.
-
The landscape of genomic structural variation in Indigenous Australians.Nature. 2023 Dec;624(7992):602-610. doi: 10.1038/s41586-023-06842-7. Epub 2023 Dec 13. Nature. 2023. PMID: 38093003 Free PMC article.
-
Comparison of sequencing data processing pipelines and application to underrepresented African human populations.BMC Bioinformatics. 2021 Oct 9;22(1):488. doi: 10.1186/s12859-021-04407-x. BMC Bioinformatics. 2021. PMID: 34627144 Free PMC article.
-
Geographic distribution and adaptive significance of genomic structural variants: an anthropological genetics perspective.Hum Biol. 2014 Fall;86(4):260-75. doi: 10.13110/humanbiology.86.4.0260. Hum Biol. 2014. PMID: 25959693 Review.
-
Systematic Review of Genetic Factors in the Etiology of Esophageal Squamous Cell Carcinoma in African Populations.Front Genet. 2019 Aug 2;10:642. doi: 10.3389/fgene.2019.00642. eCollection 2019. Front Genet. 2019. PMID: 31428123 Free PMC article.
References
Publication types
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous