Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar;639(8053):132-142.
doi: 10.1038/s41586-024-08531-5. Epub 2025 Feb 5.

The genetic origin of the Indo-Europeans

Iosif Lazaridis #  1   2 Nick Patterson #  3   4 David Anthony #  5   6 Leonid Vyazov #  7   8 Romain Fournier  9 Harald Ringbauer  10   11 Iñigo Olalde  10   12   13 Alexander A Khokhlov  14 Egor P Kitov  15 Natalia I Shishlina  16 Sorin C Ailincăi  17 Danila S Agapov  18 Sergey A Agapov  18 Elena Batieva  19 Baitanayev Bauyrzhan  20 Zsolt Bereczki  21 Alexandra Buzhilova  22 Piya Changmai  23 Andrey A Chizhevsky  24 Ion Ciobanu  25 Mihai Constantinescu  26 Marietta Csányi  27 János Dani  28   29 Peter K Dashkovskiy  30 Sándor Évinger  31 Anatoly Faifert  32 Pavel Flegontov  10   23   33 Alin Frînculeasa  34 Mădălina N Frînculeasa  35 Tamás Hajdu  36 Tom Higham  37   38 Paweł Jarosz  39 Pavol Jelínek  40 Valeri I Khartanovich  41 Eduard N Kirginekov  42 Viktória Kiss  43 Alexandera Kitova  44 Alexeiy V Kiyashko  45 Jovan Koledin  46 Arkady Korolev  14 Pavel Kosintsev  47   48 Gabriella Kulcsár  43 Pavel Kuznetsov  14 Rabadan Magomedov  49 Aslan M Mamedov  20 Eszter Melis  43 Vyacheslav Moiseyev  41 Erika Molnár  21 Janet Monge  50 Octav Negrea  34 Nadezhda A Nikolaeva  51 Mario Novak  52   53 Maria Ochir-Goryaeva  54 György Pálfi  21 Sergiu Popovici  55 Marina P Rykun  56 Tatyana M Savenkova  57 Vladimir P Semibratov  58 Nikolai N Seregin  59 Alena Šefčáková  60 Raikhan S Mussayeva  20 Irina Shingiray  61 Vladimir N Shirokov  62 Angela Simalcsik  25   63 Kendra Sirak  10   64 Konstantin N Solodovnikov  65 Judit Tárnoki  27 Alexey A Tishkin  58 Viktor Trifonov  66   67 Sergey Vasilyev  68 Ali Akbari  10   64 Esther S Brielle  10 Kim Callan  64   69 Francesca Candilio  70 Olivia Cheronet  37   38 Elizabeth Curtis  64   69 Olga Flegontova  23   33 Lora Iliev  64   69 Aisling Kearns  64 Denise Keating  71 Ann Marie Lawson  64   69 Matthew Mah  64   72   69 Adam Micco  64 Megan Michel  10   64   11   69 Jonas Oppenheimer  64   69 Lijun Qiu  64   69 J Noah Workman  64   69 Fatma Zalzala  64   69 Anna Szécsényi-Nagy  73 Pier Francesco Palamara  9   74 Swapan Mallick  64   72   69 Nadin Rohland  64 Ron Pinhasi  75   76 David Reich  77   78   79   80
Affiliations

The genetic origin of the Indo-Europeans

Iosif Lazaridis et al. Nature. 2025 Mar.

Erratum in

  • Author Correction: The genetic origin of the Indo-Europeans.
    Lazaridis I, Patterson N, Anthony D, Vyazov L, Fournier R, Ringbauer H, Olalde I, Khokhlov AA, Kitov EP, Shishlina NI, Ailincăi SC, Agapov DS, Agapov SA, Batieva E, Bauyrzhan B, Bereczki Z, Buzhilova A, Changmai P, Chizhevsky AA, Ciobanu I, Constantinescu M, Csányi M, Dani J, Dashkovskiy PK, Évinger S, Faifert A, Flegontov P, Frînculeasa A, Frînculeasa MN, Hajdu T, Higham T, Jarosz P, Jelínek P, Khartanovich VI, Kirginekov EN, Kiss V, Kitova A, Kiyashko AV, Koledin J, Korolev A, Kosintsev P, Kulcsár G, Kuznetsov P, Magomedov R, Mamedov AM, Melis E, Moiseyev V, Molnár E, Monge J, Negrea O, Nikolaeva NA, Novak M, Ochir-Goryaeva M, Pálfi G, Popovici S, Rykun MP, Savenkova TM, Semibratov VP, Seregin NN, Šefčáková A, Mussayeva RS, Shingiray I, Shirokov VN, Simalcsik A, Sirak K, Solodovnikov KN, Tárnoki J, Tishkin AA, Trifonov V, Vasilyev S, Akbari A, Brielle ES, Callan K, Candilio F, Cheronet O, Curtis E, Flegontova O, Iliev L, Kearns A, Keating D, Lawson AM, Mah M, Micco A, Michel M, Oppenheimer J, Qiu L, Workman JN, Zalzala F, Szécsényi-Nagy A, Palamara PF, Mallick S, Rohland N, Pinhasi R, Reich D. Lazaridis I, et al. Nature. 2025 Mar;639(8054):E14. doi: 10.1038/s41586-025-08789-3. Nature. 2025. PMID: 39979458 No abstract available.

Abstract

The Yamnaya archaeological complex appeared around 3300 BC across the steppes north of the Black and Caspian Seas, and by 3000 BC it reached its maximal extent, ranging from Hungary in the west to Kazakhstan in the east. To localize Yamnaya origins among the preceding Eneolithic people, we assembled ancient DNA from 435 individuals, demonstrating three genetic clines. A Caucasus-lower Volga (CLV) cline suffused with Caucasus hunter-gatherer1 ancestry extended between a Caucasus Neolithic southern end and a northern end at Berezhnovka along the lower Volga river. Bidirectional gene flow created intermediate populations, such as the north Caucasus Maikop people, and those at Remontnoye on the steppe. The Volga cline was formed as CLV people mixed with upriver populations of Eastern hunter-gatherer2 ancestry, creating hypervariable groups, including one at Khvalynsk. The Dnipro cline was formed when CLV people moved west, mixing with people with Ukraine Neolithic hunter-gatherer ancestry3 along the Dnipro and Don rivers to establish Serednii Stih groups, from whom Yamnaya ancestors formed around 4000 BC and grew rapidly after 3750-3350 BC. The CLV people contributed around four-fifths of the ancestry of the Yamnaya and, entering Anatolia, probably from the east, at least one-tenth of the ancestry of Bronze Age central Anatolians, who spoke Hittite4,5. We therefore propose that the final unity of the speakers of 'proto-Indo-Anatolian', the language ancestral to both Anatolian and Indo-European people, occurred in CLV people some time between 4400 BC and 4000 BC.

PubMed Disclaimer

Conflict of interest statement

Competing interests: The authors declare no competing interests.

Figures

Extended Data Figure 1:
Extended Data Figure 1:. The origin of Central Anatolian Bronze Age people.
(a) Models with eastern steppe sources (including CLV and Serednii Stih). Fitting models include Mesopotamian (Çayönü) and steppe ancestry. (b) Models with western sources, including Usatove and those from Southeastern Europe fail except those with Çayönü and either Mayaky or Boyanovo EBA (both of which are Yamnaya-derived). (c) The steppe (BPgroup)+Çayönü model fails all Chalcolithic/Bronze Anatolians except Central Anatolian Bronze Age. (d) Steppe (BPgroup) ancestry in the BPgroup+Çayönü model is observed in all individuals of the Central Anatolian Bronze Age (mean and ±3 s.e. estimated by qpAdm are shown for all Chalcolithic and Bronze Age individuals from Anatolia that fit the model at p>0.05) as well as in individual ART027_d from Chalcolithic Arslantepe in Eastern Anatolia. (e) BPgroup-related ancestry admixed with different substrata: Aknashen-related in the North Caucasus Maikop, Masis Blur-related in Chalcolithic Armenia, and Mesopotamian-related (Çayönü) in the ancestors of the Central Anatolian Bronze Age, following the route (f) from the North Caucasus to Anatolia; sites with BPgroup-related ancestry marked in bold. In all panels p-values estimated by qpWave are shown.
Extended Data Figure 2:
Extended Data Figure 2:. Admixture date estimates.
We estimate admixture dates for the Core Yamnaya as a mixture of European hunter-gatherer and West Asian populations (a), for the Don Yamnaya as a mixture of Core Yamnaya and UNHG (b), for the Bulgaria, Moldova, Romania, and Serbia (BMRS) Yamnaya as a mixture of Core Yamnaya and European Neolithic/Chalcolithic farmers (c), for the Corded Ware as a mixture of Core Yamnaya and Globula Amphora (d), and for a combined Caucasus-Anatolia population (Maikop-Armenia_C-TUR_C_BA) a mixture of European hunter-gatherer and West Asian populations which occurred ca. 4400 BCE (e). The Core Yamnaya were formed ca. 4000 BCE, followed by admixture ca. 3350 BCE with UNHG and European farmers in the east and west of the Dnipro-Don region and <3000 BCE in central-eastern Europe.
Extended Data Figure 3:
Extended Data Figure 3:. Population structure in people with a Yamnaya cultural affiliation.
Individuals are projected in the same space as in Fig. 1. (a) showing that the Core Yamnaya cluster (red fill symbols) from diverse sites is differentiated from the Don Yamnaya (blue fill) who tend towards the UNHG. (b) Yamnaya individuals in the West (Ukraine, Hungary, Slovakia, and Southeastern Europe) include a tight cluster of individuals as well as others that tend towards the direction of European Neolithic and Chalcolithic groups from Romania and Hungary. Individuals from Russia are shown in grey circles in panel (b). Coordinates of plotted points can be found in Online Table 6.
Extended Data Figure 4:
Extended Data Figure 4:
A 4-way model for the entire Dnipro-Don-Volga-Caucasus region. Mean and ±1 standard error estimated by qpAdm is shown.
Extended Data Figure 5:
Extended Data Figure 5:. The origin of Indo-Anatolian and Indo-European languages.
Genetic reconstruction of the ancestry of Pontic-Caspian steppe and West Asian populations points to the North Caucasus-Lower Volga area as the homeland of Indo-Anatolian languages and to the Serednii Stih archaeological culture of the Dnipro-Don area as the homeland of Indo-European languages. The Caucasus-Lower Volga people had diverse distal roots, estimated using the qpAdm software on the left barplot, as Caucasus hunter-gatherer (purple), Central Asian (red), Eastern hunter-gatherer (pink), and West Asian Neolithic (green). Caucasus-Lower Volga expansions, estimated using qpAdm on the right barplot, disseminated Caucasus Neolithic (blue)-Lower Volga Eneolithic (orange) proximal ancestries, mixing with the inhabitants of the North Pontic region (yellow), Volga region (yellow), and West Asia (green).
Figure 1:
Figure 1:. Three Eneolithic clines and their neighbors in space and time.
(a) Map with analyzed sites. (b) PCA analysis using axes formed by a set of ancient West European hunter-gatherer (WHG), Siberian, West Asian, and European farmer populations. Selected individuals relevant to this study are projected (Methods). (c) qpAdm models fitted on individuals of the populations of the clines. The Volga Cline is generated by admixture between Lower Volga (BPgroup) people with upriver Eastern hunter-gatherers (EHG). People of the Dnipro Cline have UNHG or UNHG+EHG admixture relative to the Core Yamnaya (the hunter-gatherer source along this cline is significantly variable). The Caucasus-Lower Volga Cline is generated by admixture of lower Volga people with those from the Neolithic Caucasus (Aknashen-related).
Figure 2.
Figure 2.. The three clines in the context of Eneolithic and Bronze Age admixture.
Six 3-source qpAdm models elucidate a complex history of admixture. (a) Caucasus and European hunter-gatherer admixtures in the “Old Steppe”: Krivyansky on the Lower Don received much more CHG-related admixture than upriver people of the Middle Don at Golubaya Krinitsa. In the Middle and Upper Volga and the Kama River, populations had negligible CHG-related influence. (b) The “Don-Volga” difference. On the Lower Volga and North Caucasus piedmont, the BPgroup received CHG-related ancestry like its western Lower Don counterpart at Krivyansky; but, it also received ancestry from Central Asia; this eastern influence was higher still in the Bronze Age Steppe Maikop. (c) The Volga basin Eneolithic populations vis-à-vis the Don: populations at Khvalynsk, Klopkov Bugor, and Ekaterinovka form a Volga Cline between the Berezhnvoka cluster on the Lower Volga and the upriver EHG-like populations of the Middle Volga (Labazy and Lebyazhinka). (d) the Volga basin Eneolithic populations vis-à-vis Central Asia: a slight excess of Central Asian ancestry in the Khi subset of Khvalynsk. (e) the “Dnipro” cline: the Core Yamnaya are on one end of a cline, that also includes the Don Yamnaya and Serednii Stih populations, formed by admixture from the “Caucasus-Lower Volga” (CLV) cline of differential admixture of Neolithic Caucasus and BPgroup people. The CLV Cline includes diverse people buried in kurgans at Berezhnovka, Progress-2, Remontnoye, and Maikop sites Klady and Dlinnaya-Polyana ~5000–3000 BCE. (f) “West Asian”: CLV ancestry first appears in the Chalcolithic population at Areni-1 in Armenia and is also present in the Bronze Age at Maikop. The majority of the ancestry is from West Asian sources from the Mesopotamia-Caucasus (or Çayönü-Masis Blur-Aknashen) cline. Chalcolithic and Bronze Age Anatolians lack CLV ancestry but traces of it can be found in Bronze Age Central Anatolians.
Figure 3.
Figure 3.. Patrilineal succession.
Temporal distribution of key Y-chromosome haplogroups from Kazakhstan, Kyrgyzstan, Mongolia, Russia, Turkmenistan, Ukraine, Uzbekistan, and comparative regions of Europe and West Asia 6000–1000 BCE. The Early and Middle Bronze Age group includes the Yamnaya, Afanasievo, Poltavka, Catacomb, Chemurchek, and North Caucasus cultures; the Middle and Late Bronze Age group individuals of diverse cultures down to 1000 BCE including those of the Sintashta, Andronovo, Potapovka, and Srubnaya cultures. Information on which individuals are plotted can be found in Online Table 6.
Figure 4:
Figure 4:. Trajectory of the Yamnaya expansion.
We use HapNe-LD to estimate the changes in effective population size over time of Yamnaya ancestors, performing the computation separately for the individuals from the earlier three hundred years of our sampling, and the later three hundred years; shading shows confidence intervals (dark: 50%, light: 95%). Jointly displaying these two trajectories reveals an extraordinary population expansion after 3642–3374 BCE (intersection of 95% confidence intervals for the two analyses for the minimum), from when the effective size is a few thousand to an order of magnitude larger. The offset on the x-axis is due to the difference in sampling time between the two groups.
Figure 5:
Figure 5:. IBD analysis of the Yamnaya and their predecessors.
Pairs of individuals linked by at least one IBD segment ≥20cM in length reveal a sparse but highly connected network in the Pre-Yamnaya (Methods) (a) and Yamnaya (b) groups. No detectible IBD is found in the Pre-Yamnaya period beyond the scale of 1000km (c); Yamnaya share more IBD with each other at short distance scales but IBD sharing extends all the way to the ~6000km scale of their geographical distribution (d). However, closely related individuals only occur at short distance scales in both Pre-Yamnaya (e) and Yamnaya (f) groups, indicating that the IBD sharing in the Yamnaya was a legacy of their common origin. In panels (c-f) we display the two-sided 95% confidence interval as a vertical interval (at distance=0) or rectangle (at distance ranges>0) and in red the fraction of dividing the number of pairs of individuals sharing IBD (I)/total number of pairs of individuals (T). (g) In a set of 9 Yamnaya cemeteries, and a total of 25 kurgans, closely or distantly related individuals are virtually absent in inter-cemetery comparisons, more are found in inter-kurgan/within-cemetery comparisons, and more still in intra-kurgan comparisons; nonetheless, most Yamnaya individuals in all comparisons were unrelated. Kurgan burial of close kin was less common than in the case of a local patrilineal dynasty as at a Neolithic long cairn at Neolithic Hazleton North, but more common than in Neolithic monuments of Ireland. Two-sided 95% confidence intervals are shown.

Update of

  • The Genetic Origin of the Indo-Europeans.
    Lazaridis I, Patterson N, Anthony D, Vyazov L, Fournier R, Ringbauer H, Olalde I, Khokhlov AA, Kitov EP, Shishlina NI, Ailincăi SC, Agapov DS, Agapov SA, Batieva E, Bauyrzhan B, Bereczki Z, Buzhilova A, Changmai P, Chizhevsky AA, Ciobanu I, Constantinescu M, Csányi M, Dani J, Dashkovskiy PK, Évinger S, Faifert A, Flegontov PN, Frînculeasa A, Frînculeasa MN, Hajdu T, Higham T, Jarosz P, Jelínek P, Khartanovich VI, Kirginekov EN, Kiss V, Kitova A, Kiyashko AV, Koledin J, Korolev A, Kosintsev P, Kulcsár G, Kuznetsov P, Magomedov R, Malikovich MA, Melis E, Moiseyev V, Molnár E, Monge J, Negrea O, Nikolaeva NA, Novak M, Ochir-Goryaeva M, Pálfi G, Popovici S, Rykun MP, Savenkova TM, Semibratov VP, Seregin NN, Šefčáková A, Serikovna MR, Shingiray I, Shirokov VN, Simalcsik A, Sirak K, Solodovnikov KN, Tárnoki J, Tishkin AA, Trifonov V, Vasilyev S, Akbari A, Brielle ES, Callan K, Candilio F, Cheronet O, Curtis E, Flegontova O, Iliev L, Kearns A, Keating D, Lawson AM, Mah M, Micco A, Michel M, Oppenheimer J, Qiu L, Noah Workman J, Zalzala F, Szécsényi-Nagy A, Palamara PF, Mallick S, Rohland N, Pinhasi R, Reich D. Lazaridis I, et al. bioRxiv [Preprint]. 2024 Apr 18:2024.04.17.589597. doi: 10.1101/2024.04.17.589597. bioRxiv. 2024. Update in: Nature. 2025 Mar;639(8053):132-142. doi: 10.1038/s41586-024-08531-5. PMID: 38659893 Free PMC article. Updated. Preprint.

References

    1. Jones ER et al. Upper Palaeolithic genomes reveal deep roots of modern Eurasians. Nat Commun 6, 8912, doi: 10.1038/ncomms9912 (2015). - DOI - PMC - PubMed
    1. Haak W. et al. Massive migration from the steppe was a source for Indo-European languages in Europe. Nature 522, 207–211, doi: 10.1038/nature14317 (2015). - DOI - PMC - PubMed
    1. Mathieson I. et al. The genomic history of southeastern Europe. Nature 555, 197–203, doi: 10.1038/nature25778 (2018). - DOI - PMC - PubMed
    1. Kloekhorst A. in The Indo-European Puzzle Revisited: Integrating Archaeology, Genetics, and Linguistics (eds Eske Willerslev, Guus Kroonen, & Kristian Kristiansen) 42–60 (Cambridge University Press, 2023).
    1. Kroonen G, Barjamovic G. & Peyrot M. Linguistic supplement to Damgaard et al. 2018 : Early Indo-European languages, Anatolian, Tocharian and Indo-Iranian. (2018). <https://zenodo.org/doi/10.5281/zenodo.1240523>. - DOI

Publication types

LinkOut - more resources