Recruiting a skeleton crew-Methods for simulating and augmenting paleoanthropological data using Monte Carlo based algorithms
- PMID: 37199044
- DOI: 10.1002/ajpa.24754
Recruiting a skeleton crew-Methods for simulating and augmenting paleoanthropological data using Monte Carlo based algorithms
Abstract
Objectives: Data collection is a major hindrance in many types of analyses in human evolutionary studies. This issue is fundamental when considering the scarcity and quality of fossil data. From this perspective, many research projects are impeded by the amount of data available to perform tasks such as classification and predictive modeling.
Materials and methods: Here we present the use of Monte Carlo based methods for the simulation of paleoanthropological data. Using two datasets containing cross-sectional biomechanical information and geometric morphometric 3D landmarks, we show how synthetic, yet realistic, data can be simulated to enhance each dataset, and provide new information with which to perform complex tasks with, in particular classification. We additionally present these algorithms in the form of an R library; AugmentationMC. We also use a geometric morphometric dataset to simulate 3D models, and emphasize the power of Machine Teaching, as opposed to Machine Learning.
Results: Our results show how Monte Carlo based algorithms, such as the Markov Chain Monte Carlo, are useful for the simulation of morphometric data, providing synthetic yet highly realistic data that has been tested statistically to be equivalent to the original data. We additionally provide a critical overview of bootstrapping techniques, showing how Monte Carlo based methods perform better than bootstrapping as the data simulated is not an exact copy of the original sample.
Discussion: While synthetic datasets should never replace large and real datasets, this can be considered an important advance in how paleoanthropological data can be handled.
Keywords: 3D model simulation; Markov chain Monte Carlo; data augmentation; geometric morphometrics; machine teaching.
© 2023 Wiley Periodicals LLC.
Similar articles
-
Parametric and nonparametric population methods: their comparative performance in analysing a clinical dataset and two Monte Carlo simulation studies.Clin Pharmacokinet. 2006;45(4):365-83. doi: 10.2165/00003088-200645040-00003. Clin Pharmacokinet. 2006. PMID: 16584284
-
Parallel Markov chain Monte Carlo - bridging the gap to high-performance Bayesian computation in animal breeding and genetics.Genet Sel Evol. 2012 Sep 25;44(1):29. doi: 10.1186/1297-9686-44-29. Genet Sel Evol. 2012. PMID: 23009363 Free PMC article.
-
Alternative methods for H1 simulations in genome-wide association studies.Hum Hered. 2012;73(2):95-104. doi: 10.1159/000336194. Epub 2012 Mar 28. Hum Hered. 2012. PMID: 22472690
-
Metropolis sampling in pedigree analysis.Stat Methods Med Res. 1993;2(3):263-82. doi: 10.1177/096228029300200305. Stat Methods Med Res. 1993. PMID: 8261261 Review.
-
Accelerating MCMC algorithms.Wiley Interdiscip Rev Comput Stat. 2018 Sep-Oct;10(5):e1435. doi: 10.1002/wics.1435. Epub 2018 Jun 13. Wiley Interdiscip Rev Comput Stat. 2018. PMID: 30167072 Free PMC article. Review.
References
REFERENCES
-
- Achlioptas, P., Diamanti, O., Mitliagkas, I., & Guibas, L. (2018). Learning representations and generative models for 3D point clouds. International Conference on Learning Representations. https://arxiv.org/abs/1707.02392
-
- Albrecht, G. H. (1992). Assessing the affinities of fossils using canonical variates and generalised distances. Journal of Human Evolution, 7, 49-69.
-
- Alemseged, Z., Spoor, F., Kimbel, W. H., Bobe, R., Geraads, D., Reed, D., & Wynn, J. G. (2006). A juvenile early hominin skeleton from Dikika, Ethiopia. Nature, 443, 296-301.
-
- Almécija, S., Tallman, M., Alba, D. M., Pina, M., Moyà-Solà, S., & Jungers, W. L. (2013). The femur of Orrorin tugenensis exhibits morphometric affinities with both Miocene apes and later hominins. Nature Communications, 4, 2888.
-
- Almécija, S., Tallman, M., Sallam, H. M., Fleagle, J. G., Hammond, A. S., & Seiffert, E. R. (2019). Early anthropoid femora reveal divergent adaptive trajectories in catarrhine hind-limb evolution. Nature Communications, 10, 4778.