A maximum-likelihood method to correct for allelic dropout in microsatellite data with no replicate genotypes
- PMID: 22851645
- PMCID: PMC3660999
- DOI: 10.1534/genetics.112.139519
A maximum-likelihood method to correct for allelic dropout in microsatellite data with no replicate genotypes
Abstract
Allelic dropout is a commonly observed source of missing data in microsatellite genotypes, in which one or both allelic copies at a locus fail to be amplified by the polymerase chain reaction. Especially for samples with poor DNA quality, this problem causes a downward bias in estimates of observed heterozygosity and an upward bias in estimates of inbreeding, owing to mistaken classifications of heterozygotes as homozygotes when one of the two copies drops out. One general approach for avoiding allelic dropout involves repeated genotyping of homozygous loci to minimize the effects of experimental error. Existing computational alternatives often require replicate genotyping as well. These approaches, however, are costly and are suitable only when enough DNA is available for repeated genotyping. In this study, we propose a maximum-likelihood approach together with an expectation-maximization algorithm to jointly estimate allelic dropout rates and allele frequencies when only one set of nonreplicated genotypes is available. Our method considers estimates of allelic dropout caused by both sample-specific factors and locus-specific factors, and it allows for deviation from Hardy-Weinberg equilibrium owing to inbreeding. Using the estimated parameters, we correct the bias in the estimation of observed heterozygosity through the use of multiple imputations of alleles in cases where dropout might have occurred. With simulated data, we show that our method can (1) effectively reproduce patterns of missing data and heterozygosity observed in real data; (2) correctly estimate model parameters, including sample-specific dropout rates, locus-specific dropout rates, and the inbreeding coefficient; and (3) successfully correct the downward bias in estimating the observed heterozygosity. We find that our method is fairly robust to violations of model assumptions caused by population structure and by genotyping errors from sources other than allelic dropout. Because the data sets imputed under our model can be investigated in additional subsequent analyses, our method will be useful for preparing data for applications in diverse contexts in population genetics and molecular ecology.
Figures









Similar articles
-
Maximum-likelihood estimation of allelic dropout and false allele error rates from microsatellite genotypes in the absence of reference data.Genetics. 2007 Feb;175(2):827-42. doi: 10.1534/genetics.106.064618. Epub 2006 Dec 18. Genetics. 2007. PMID: 17179070 Free PMC article.
-
Maximum likelihood estimation of individual inbreeding coefficients and null allele frequencies.Genet Res (Camb). 2012 Jun;94(3):151-61. doi: 10.1017/S0016672312000341. Epub 2012 Jul 18. Genet Res (Camb). 2012. PMID: 22805896
-
Assessing allelic dropout and genotype reliability using maximum likelihood.Genetics. 2002 Jan;160(1):357-66. doi: 10.1093/genetics/160.1.357. Genetics. 2002. PMID: 11805071 Free PMC article.
-
A maximum-likelihood estimation of pairwise relatedness for autopolyploids.Heredity (Edinb). 2015 Feb;114(2):133-42. doi: 10.1038/hdy.2014.88. Epub 2014 Nov 5. Heredity (Edinb). 2015. PMID: 25370210 Free PMC article. Review.
-
Drawing inferences about the coancestry coefficient.Theor Popul Biol. 2009 Jun;75(4):312-9. doi: 10.1016/j.tpb.2009.03.005. Epub 2009 Apr 2. Theor Popul Biol. 2009. PMID: 19345237 Free PMC article. Review.
Cited by
-
Challenges in analysis and interpretation of microsatellite data for population genetic studies.Ecol Evol. 2014 Nov;4(22):4399-428. doi: 10.1002/ece3.1305. Epub 2014 Oct 30. Ecol Evol. 2014. PMID: 25540699 Free PMC article. Review.
-
Impact of genotypic errors with equal and unequal family contribution on accuracy of genomic prediction in aquaculture using simulation.Sci Rep. 2021 Sep 15;11(1):18318. doi: 10.1038/s41598-021-97873-5. Sci Rep. 2021. PMID: 34526591 Free PMC article.
-
Multiple Paternity in a Reintroduced Population of the Orinoco Crocodile (Crocodylus intermedius) at the El Frío Biological Station, Venezuela.PLoS One. 2016 Mar 16;11(3):e0150245. doi: 10.1371/journal.pone.0150245. eCollection 2016. PLoS One. 2016. PMID: 26982578 Free PMC article.
-
Appraising the Genetic Makeup of an Allochthonous Southern Pike Population: An Opportunity to Predict the Evolution of Introgressive Hybridization in Isolated Populations?Animals (Basel). 2023 Jan 22;13(3):380. doi: 10.3390/ani13030380. Animals (Basel). 2023. PMID: 36766269 Free PMC article.
-
STRyper: A macOS application for microsatellite genotyping and chromatogram management.PLoS One. 2025 Feb 20;20(2):e0318806. doi: 10.1371/journal.pone.0318806. eCollection 2025. PLoS One. 2025. PMID: 39977418 Free PMC article.
References
-
- Bonin A., Bellemain E., Eidesen P. B., Pompanon F., Brochmann C., et al. , 2004. How to track and assess genotyping errors in population genetics studies. Mol. Ecol. 13: 3261–3273 - PubMed
-
- Broquet T., Petit E., 2004. Quantifying genotyping errors in noninvasive population genetics. Mol. Ecol. 13: 3601–3608 - PubMed
-
- Broquet T., Ménard N., Petit E., 2007. Noninvasive population genetics: a review of sample source, diet, fragment length and microsatellite motif effects on amplification success and genotyping error rates. Conserv. Genet. 8: 249–260
-
- Buchan J. C., Archie E. A., van Horn R. C., Moss C. J., Alberts S. C., 2005. Locus effects and sources of error in noninvasive genotyping. Mol. Ecol. Notes 5: 680–683
-
- Casella G., Berger R. L., 2001. Statistical Inference, Ed. 2 Duxbury, Pacific Grove, CA
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources