Copy number variation in human genomes from three major ethno-linguistic groups in Africa
- PMID: 32272904
- PMCID: PMC7147055
- DOI: 10.1186/s12864-020-6669-y
Copy number variation in human genomes from three major ethno-linguistic groups in Africa
Abstract
Background: Copy number variation is an important class of genomic variation that has been reported in 75% of the human genome. However, it is underreported in African populations. Copy number variants (CNVs) could have important impacts on disease susceptibility and environmental adaptation. To describe CNVs and their possible impacts in Africans, we sequenced genomes of 232 individuals from three major African ethno-linguistic groups: (1) Niger Congo A from Guinea and Côte d'Ivoire, (2) Niger Congo B from Uganda and the Democratic Republic of Congo and (3) Nilo-Saharans from Uganda. We used GenomeSTRiP and cn.MOPS to identify copy number variant regions (CNVRs).
Results: We detected 7608 CNVRs, of which 2172 were only deletions, 2384 were only insertions and 3052 had both. We detected 224 previously un-described CNVRs. The majority of novel CNVRs were present at low frequency and were not shared between populations. We tested for evidence of selection associated with CNVs and also for population structure. Signatures of selection identified previously, using SNPs from the same populations, were overrepresented in CNVRs. When CNVs were tagged with SNP haplotypes to identify SNPs that could predict the presence of CNVs, we identified haplotypes tagging 3096 CNVRs, 372 CNVRs had SNPs with evidence of selection (iHS > 3) and 222 CNVRs had both. This was more than expected (p < 0.0001) and included loci where CNVs have previously been associated with HIV, Rhesus D and preeclampsia. When integrated with 1000 Genomes CNV data, we replicated their observation of population stratification by continent but no clustering by populations within Africa, despite inclusion of Nilo-Saharans and Niger-Congo populations within our dataset.
Conclusions: Novel CNVRs in the current study increase representation of African diversity in the database of genomic variants. Over-representation of CNVRs in SNP signatures of selection and an excess of SNPs that both tag CNVs and are subject to selection show that CNVs may be the actual targets of selection at some loci. However, unlike SNPs, CNVs alone do not resolve African ethno-linguistic groups. Tag haplotypes for CNVs identified may be useful in predicting African CNVs in future studies where only SNP data is available.
Keywords: Adaptation; CNV; Niger Congo A; Niger Congo B; Nilo-Saharan; Signatures of selection; Structural variation; Tag haplotypes.
Conflict of interest statement
The authors declare that they have no competing interests.
Figures






Similar articles
-
High Levels of Genetic Diversity within Nilo-Saharan Populations: Implications for Human Adaptation.Am J Hum Genet. 2020 Sep 3;107(3):473-486. doi: 10.1016/j.ajhg.2020.07.007. Epub 2020 Aug 10. Am J Hum Genet. 2020. PMID: 32781046 Free PMC article.
-
Copy number variations in the genome of the Qatari population.BMC Genomics. 2015 Oct 22;16:834. doi: 10.1186/s12864-015-1991-5. BMC Genomics. 2015. PMID: 26490036 Free PMC article.
-
Inter- and intra-breed genome-wide copy number diversity in a large cohort of European equine breeds.BMC Genomics. 2019 Oct 22;20(1):759. doi: 10.1186/s12864-019-6141-z. BMC Genomics. 2019. PMID: 31640551 Free PMC article.
-
African genetic diversity provides novel insights into evolutionary history and local adaptations.Hum Mol Genet. 2018 Aug 1;27(R2):R209-R218. doi: 10.1093/hmg/ddy161. Hum Mol Genet. 2018. PMID: 29741686 Free PMC article. Review.
-
Genome wide copy number variations using Porcine 60K SNP Beadchip in Landlly pigs.Anim Biotechnol. 2023 Nov;34(6):1891-1899. doi: 10.1080/10495398.2022.2056047. Epub 2022 Apr 4. Anim Biotechnol. 2023. PMID: 35369845 Review.
Cited by
-
Genome-wide analysis of copy number variants and normal facial variation in a large cohort of Bantu Africans.HGG Adv. 2021 Dec 24;3(1):100082. doi: 10.1016/j.xhgg.2021.100082. eCollection 2022 Jan 13. HGG Adv. 2021. PMID: 35047866 Free PMC article.
-
Incorporating CNV analysis improves the yield of exome sequencing for rare monogenic disorders-an important consideration for resource-constrained settings.Front Genet. 2023 Dec 14;14:1277784. doi: 10.3389/fgene.2023.1277784. eCollection 2023. Front Genet. 2023. PMID: 38155715 Free PMC article. Review.
-
An assessment of the genomic structural variation landscape in Sub-Saharan African populations.Res Sq [Preprint]. 2024 Jul 8:rs.3.rs-4485126. doi: 10.21203/rs.3.rs-4485126/v1. Res Sq. 2024. PMID: 39041024 Free PMC article. Preprint.
-
Genomewide Association Study Identifies Copy Number Variants Associated With Warfarin Dose Response and Risk of Venous Thromboembolism in African Americans.Clin Pharmacol Ther. 2023 Mar;113(3):624-633. doi: 10.1002/cpt.2820. Epub 2023 Jan 19. Clin Pharmacol Ther. 2023. PMID: 36507737 Free PMC article.
-
Genome-wide copy number variations in a large cohort of bantu African children.BMC Med Genomics. 2021 May 17;14(1):129. doi: 10.1186/s12920-021-00978-z. BMC Med Genomics. 2021. PMID: 34001112 Free PMC article.
References
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources