Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Nov 3;24(1):415.
doi: 10.1186/s12859-023-05554-z.

Estimating microhaplotype allele frequencies from low-coverage or pooled sequencing data

Affiliations

Estimating microhaplotype allele frequencies from low-coverage or pooled sequencing data

Thomas A Delomas et al. BMC Bioinformatics. .

Abstract

Background: Microhaplotypes have the potential to be more cost-effective than SNPs for applications that require genetic panels of highly variable loci. However, development of microhaplotype panels is hindered by a lack of methods for estimating microhaplotype allele frequency from low-coverage whole genome sequencing or pooled sequencing (pool-seq) data.

Results: We developed new methods for estimating microhaplotype allele frequency from low-coverage whole genome sequence and pool-seq data. We validated these methods using datasets from three non-model organisms. These methods allowed estimation of allele frequency and expected heterozygosity at depths routinely achieved from pooled sequencing.

Conclusions: These new methods will allow microhaplotype panels to be designed using low-coverage WGS and pool-seq data to discover and evaluate candidate loci. The python script implementing the two methods and documentation are available at https://www.github.com/delomast/mhFromLowDepSeq .

Keywords: Genotype panel design; Low-depth whole genome sequencing; Microhaplotype; Pool-seq; Skim-seq.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Distribution of error in estimated allele frequencies from the oyster dataset binned by the number of reads contributing to the estimate. Sample sizes in bins from left to right: 194,204, 125,410, 54,818, 81,027, 96,920, 90,745, 75,136, 63,388, 113,454, 97,540
Fig. 2
Fig. 2
Mean square error in estimated allele frequencies from the oyster dataset binned by the number of reads contributing to the estimate. Sample sizes in bins from left to right: 194,204, 125,410, 54,818, 81,027, 96,920, 90,745, 75,136, 63,388, 113,454, 97,540
Fig. 3
Fig. 3
Distribution of error in expected heterozygosity from the oyster dataset binned by the number of reads contributing to the estimate. Sample sizes in bins from left to right: 194,204, 125,410, 54,818, 81,027, 96,920, 90,745, 75,136, 63,388, 113,454, 97,540
Fig. 4
Fig. 4
Mean square error in expected heterozygosity from the oyster dataset binned by the number of reads contributing to the estimate. Sample sizes in bins from left to right: 194,204, 125,410, 54,818, 81,027, 96,920, 90,745, 75,136, 63,388, 113,454, 97,540

References

    1. Kayser M, de Knijff P. Improving human forensics through advances in genetics, genomics and molecular biology. Nat Rev Genet. 2011;12(3):179–192. doi: 10.1038/nrg2952. - DOI - PubMed
    1. Weller JI, Ezra E, Ron M. Invited review: a perspective on the future of genomic selection in dairy cattle. J Dairy Sci. 2017;100(11):8633–8644. doi: 10.3168/jds.2017-12879. - DOI - PubMed
    1. Hollenbeck CM, Johnston IA. Genomic tools and selective breeding in molluscs. Front Genet. 2018;9:253. doi: 10.3389/fgene.2018.00253. - DOI - PMC - PubMed
    1. Hargrove JS, Camacho CA, Schrader WC, Powell JH, Delomas TA, Hess JE, et al. Parentage-based tagging improves escapement estimates for ESA-listed adult Chinook salmon and steelhead in the Snake River basin. Can J Fish Aquat Sci. 2021;78(4):349–360. doi: 10.1139/cjfas-2020-0152. - DOI
    1. Howe R, Miron-Shatz T, Hanoch Y, Omer ZB, O’Donoghue C, Ozanne EM. Personalized medicine through SNP testing for breast cancer risk: Clinical implementation. J Genet Couns. 2015;24(5):744–751. doi: 10.1007/s10897-014-9803-7. - DOI - PubMed

MeSH terms

LinkOut - more resources