CAPG: comprehensive allopolyploid genotyper
- PMID: 36367243
- PMCID: PMC9825759
- DOI: 10.1093/bioinformatics/btac729
CAPG: comprehensive allopolyploid genotyper
Abstract
Motivation: Genotyping by sequencing is a powerful tool for investigating genetic variation in plants, but many economically important plants are allopolyploids, where homoeologous similarity obscures the subgenomic origin of reads and confounds allelic and homoeologous SNPs. Recent polyploid genotyping methods use allelic frequencies, rate of heterozygosity, parental cross or other information to resolve read assignment, but good subgenomic references offer the most direct information. The typical strategy aligns reads to the joint reference, performs diploid genotyping within each subgenome, and filters the results, but persistent read misassignment results in an excess of false heterozygous calls.
Results: We introduce the Comprehensive Allopolyploid Genotyper (CAPG), which formulates an explicit likelihood to weight read alignments against both subgenomic references and genotype individual allopolyploids from whole-genome resequencing data. We demonstrate CAPG in allotetraploids, where it performs better than Genome Analysis Toolkit's HaplotypeCaller applied to reads aligned to the combined subgenomic references.
Availability and implementation: Code and tutorials are available at https://github.com/Kkulkarni1/CAPG.git.
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author(s) 2022. Published by Oxford University Press.
Figures





Similar articles
-
SVJedi: genotyping structural variations with long reads.Bioinformatics. 2020 Nov 1;36(17):4568-4575. doi: 10.1093/bioinformatics/btaa527. Bioinformatics. 2020. PMID: 32437523
-
SNP genotyping and parameter estimation in polyploids using low-coverage sequencing data.Bioinformatics. 2018 Feb 1;34(3):407-415. doi: 10.1093/bioinformatics/btx587. Bioinformatics. 2018. PMID: 29028881
-
muCNV: Genotyping Structural Variants for Population-level Sequencing.Bioinformatics. 2021 Aug 4;37(14):2055–2057. doi: 10.1093/bioinformatics/btab199. Epub 2021 Mar 24. Bioinformatics. 2021. PMID: 33760063 Free PMC article.
-
One Size Doesn't Fit All - RefEditor: Building Personalized Diploid Reference Genome to Improve Read Mapping and Genotype Calling in Next Generation Sequencing Studies.PLoS Comput Biol. 2015 Aug 12;11(8):e1004448. doi: 10.1371/journal.pcbi.1004448. eCollection 2015 Aug. PLoS Comput Biol. 2015. PMID: 26267278 Free PMC article.
-
Toward better understanding of artifacts in variant calling from high-coverage samples.Bioinformatics. 2014 Oct 15;30(20):2843-51. doi: 10.1093/bioinformatics/btu356. Epub 2014 Jun 27. Bioinformatics. 2014. PMID: 24974202 Free PMC article. Review.
Cited by
-
Demographic history inference and the polyploid continuum.Genetics. 2023 Aug 9;224(4):iyad107. doi: 10.1093/genetics/iyad107. Genetics. 2023. PMID: 37279657 Free PMC article.
References
-
- Altschul S.F. et al. (1990) Basic local alignment search tool. J. Mol. Biol., 215, 403–410. - PubMed
-
- Bertioli D.J. et al. (2016) The genome sequences of Arachis duranensis and Arachis ipaensis, the diploid ancestors of cultivated peanut. Nat. Genet., 48, 438–446. - PubMed
-
- Bertioli D.J. et al. (2019) The genome sequence of segmental allotetraploid peanut Arachis hypogaea. Nat. Genet., 51, 877–884. - PubMed
-
- Blischak P.D. et al. (2018) SNP genotyping and parameter estimation in polyploids using low-coverage sequencing data. Bioinformatics, 34, 407–415. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous