Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Dec 9;89(6):731-44.
doi: 10.1016/j.ajhg.2011.11.010.

Shared and unique components of human population structure and genome-wide signals of positive selection in South Asia

Affiliations

Shared and unique components of human population structure and genome-wide signals of positive selection in South Asia

Mait Metspalu et al. Am J Hum Genet. .

Erratum in

  • Am J Hum Genet. 2012 Feb 10;90(2):378-9

Abstract

South Asia harbors one of the highest levels genetic diversity in Eurasia, which could be interpreted as a result of its long-term large effective population size and of admixture during its complex demographic history. In contrast to Pakistani populations, populations of Indian origin have been underrepresented in previous genomic scans of positive selection and population structure. Here we report data for more than 600,000 SNP markers genotyped in 142 samples from 30 ethnic groups in India. Combining our results with other available genome-wide data, we show that Indian populations are characterized by two major ancestry components, one of which is spread at comparable frequency and haplotype diversity in populations of South and West Asia and the Caucasus. The second component is more restricted to South Asia and accounts for more than 50% of the ancestry in Indian populations. Haplotype diversity associated with these South Asian ancestry components is significantly higher than that of the components dominating the West Eurasian ancestry palette. Modeling of the observed haplotype diversities suggests that both Indian ancestry components are older than the purported Indo-Aryan invasion 3,500 YBP. Consistent with the results of pairwise genetic distances among world regions, Indians share more ancestry signals with West than with East Eurasians. However, compared to Pakistani populations, a higher proportion of their genes show regionally specific signals of high haplotype homozygosity. Among such candidates of positive selection in India are MSTN and DOK5, both of which have potential implications in lipid metabolism and the etiology of type 2 diabetes.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Matrix of Pairwise Mean FST Values of Regional Groupings of the Studied Populations Average of intergroup FST values (where the regional group is composed of multiple populations) is given in the diagonal. Central India is itself a composite of two regional groupings of samples from different populations that makes the negative intergroup FST uninformative.
Figure 2
Figure 2
Genome-Wide Structure of the Studied Populations Revealed by 530,000 SNPs (A) principal component analysis of the Eurasian populations. The following abbreviations are used: IE, Indo European speakers; DR, Dravidic speakers; AA, Austroasiatic speakers; TB, Tibeto Burman speakers; , data from Hapmap. (B) ADMIXTURE analysis at K = 8 and 12. The following symbols are used: , contains one Dhurwa; ∗∗, contains one Lambadi; 1, Rajasthan; 2, Chattisgarh and Jharkhand; 3, Chattisgarh, Orissa, and Madhya Pradesh. A.P., Andhra Pradesh; Kar, Karnataka; Ker, Kerala; T. Nadu, Tamil Nadu; #, Nihali language isolate speakers from Maharasthra; §, Tibeto Burman speakers from east Indian states Meghalaya and Nagaland; AA, Austroasiatic languages.
Figure 3
Figure 3
Sharing Signals for Selection between Continental Populations (A) iHS signal sharing between continental populations. The fraction of signals found in the top 1% of iHS scores in population i and the top 5% of population j is given in cell (i,j). Africa refers to Yoruba, Mandenka, and Bantu individuals from the HGDP-CEPH panel. (B) XP-EHH signal sharing between continental populations. The fraction of signals found in the top 1% of XP-EHH scores in population i and the top 5% of population j is given in cell (i,j). Africa refers to Yoruba, Mandenka, and Bantu individuals from the HGDP-CEPH panel.

References

    1. Xing J., Watkins W.S., Hu Y., Huff C.D., Sabo A., Muzny D.M., Bamshad M.J., Gibbs R.A., Jorde L.B., Yu F. Genetic diversity in India and the inference of Eurasian population expansion. Genome Biol. 2010;11:R113. - PMC - PubMed
    1. Atkinson Q.D., Gray R.D., Drummond A.J. mtDNA variation predicts population size in humans and reveals a major Southern Asian chapter in human prehistory. Mol. Biol. Evol. 2008;25:468–474. - PubMed
    1. Kivisild T., Bamshad M.J., Kaldma K., Metspalu M., Metspalu E., Reidla M., Laos S., Parik J., Watkins W.S., Dixon M.E., et al. Deep common ancestry of indian and western-Eurasian mitochondrial DNA lineages. Curr. Biol. 1999;9:1331–1334. - PubMed
    1. Kivisild T., Kaldma K., Metspalu M., Parik J., Papiha S.S., Villems R. In: Genomic diversity. Papiha S.S., Deka R., Chakraborty R., editors. Kluwer Academic/Plenum Publishers; 1999. The place of the Indian mitochondrial DNA variants in the global network of maternal lineages and the peopling of the Old World; pp. 135–152.
    1. Metspalu M., Kivisild T., Metspalu E., Parik J., Hudjashov G., Kaldma K., Serk P., Karmin M., Behar D.M., Gilbert M.T., et al. Most of the extant mtDNA boundaries in south and southwest Asia were likely shaped during the initial settlement of Eurasia by anatomically modern humans. BMC Genet. 2004;5:26. - PMC - PubMed

Publication types