Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2023 Jul;181(3):454-473.
doi: 10.1002/ajpa.24754. Epub 2023 May 17.

Recruiting a skeleton crew-Methods for simulating and augmenting paleoanthropological data using Monte Carlo based algorithms

Affiliations
Review

Recruiting a skeleton crew-Methods for simulating and augmenting paleoanthropological data using Monte Carlo based algorithms

Lloyd A Courtenay et al. Am J Biol Anthropol. 2023 Jul.

Abstract

Objectives: Data collection is a major hindrance in many types of analyses in human evolutionary studies. This issue is fundamental when considering the scarcity and quality of fossil data. From this perspective, many research projects are impeded by the amount of data available to perform tasks such as classification and predictive modeling.

Materials and methods: Here we present the use of Monte Carlo based methods for the simulation of paleoanthropological data. Using two datasets containing cross-sectional biomechanical information and geometric morphometric 3D landmarks, we show how synthetic, yet realistic, data can be simulated to enhance each dataset, and provide new information with which to perform complex tasks with, in particular classification. We additionally present these algorithms in the form of an R library; AugmentationMC. We also use a geometric morphometric dataset to simulate 3D models, and emphasize the power of Machine Teaching, as opposed to Machine Learning.

Results: Our results show how Monte Carlo based algorithms, such as the Markov Chain Monte Carlo, are useful for the simulation of morphometric data, providing synthetic yet highly realistic data that has been tested statistically to be equivalent to the original data. We additionally provide a critical overview of bootstrapping techniques, showing how Monte Carlo based methods perform better than bootstrapping as the data simulated is not an exact copy of the original sample.

Discussion: While synthetic datasets should never replace large and real datasets, this can be considered an important advance in how paleoanthropological data can be handled.

Keywords: 3D model simulation; Markov chain Monte Carlo; data augmentation; geometric morphometrics; machine teaching.

PubMed Disclaimer

Similar articles

References

REFERENCES

    1. Achlioptas, P., Diamanti, O., Mitliagkas, I., & Guibas, L. (2018). Learning representations and generative models for 3D point clouds. International Conference on Learning Representations. https://arxiv.org/abs/1707.02392
    1. Albrecht, G. H. (1992). Assessing the affinities of fossils using canonical variates and generalised distances. Journal of Human Evolution, 7, 49-69.
    1. Alemseged, Z., Spoor, F., Kimbel, W. H., Bobe, R., Geraads, D., Reed, D., & Wynn, J. G. (2006). A juvenile early hominin skeleton from Dikika, Ethiopia. Nature, 443, 296-301.
    1. Almécija, S., Tallman, M., Alba, D. M., Pina, M., Moyà-Solà, S., & Jungers, W. L. (2013). The femur of Orrorin tugenensis exhibits morphometric affinities with both Miocene apes and later hominins. Nature Communications, 4, 2888.
    1. Almécija, S., Tallman, M., Sallam, H. M., Fleagle, J. G., Hammond, A. S., & Seiffert, E. R. (2019). Early anthropoid femora reveal divergent adaptive trajectories in catarrhine hind-limb evolution. Nature Communications, 10, 4778.

Publication types