Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Nov 22;22(1):559.
doi: 10.1186/s12859-021-04470-4.

trioPhaser: using Mendelian inheritance logic to improve genomic phasing of trios

Affiliations

trioPhaser: using Mendelian inheritance logic to improve genomic phasing of trios

Dustin B Miller et al. BMC Bioinformatics. .

Abstract

Background: When analyzing DNA sequence data of an individual, knowing which nucleotide was inherited from each parent can be beneficial when trying to identify certain types of DNA variants. Mendelian inheritance logic can be used to accurately phase (haplotype) the majority (67-83%) of an individual's heterozygous nucleotide positions when genotypes are available for both parents (trio). However, when all members of a trio are heterozygous at a position, Mendelian inheritance logic cannot be used to phase. For such positions, a computational phasing algorithm can be used. Existing phasing algorithms use a haplotype reference panel, sequencing reads, and/or parental genotypes to phase an individual; however, they are limited in that they can only phase certain types of variants, require a specific genotype build, require large amounts of storage capacity, and/or require long run times. We created trioPhaser to address these challenges.

Results: trioPhaser uses gVCF files from an individual and their parents as initial input, and then outputs a phased VCF file. Input trio data are first phased using Mendelian inheritance logic. Then, the positions that cannot be phased using inheritance information alone are phased by the SHAPEIT4 phasing algorithm. Using whole-genome sequencing data of 52 trios, we show that trioPhaser, on average, increases the total number of phased positions by 21.0% and 10.5%, respectively, when compared to the number of positions that SHAPEIT4 or Mendelian inheritance logic can phase when either is used alone. In addition, we show that the accuracy of the phased calls output by trioPhaser are similar to linked-read and read-backed phasing.

Conclusion: trioPhaser is a containerized software tool that uses both Mendelian inheritance logic and SHAPEIT4 to phase trios when gVCF files are available. By implementing both phasing methods, more variant positions are phased compared to what either method is able to phase alone.

Keywords: Genomics; Haplotyping; Next-generation sequencing; Phasing; Trios.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
trioPhaser workflow diagram. All steps are conducted within a Docker container. Input files are shown in red and the output file is shown in blue. Steps 1–5 produce temporary output files which are used as input to the subsequent step

Similar articles

Cited by

References

    1. Choi Y, Chan AP, Kirkness E, Telenti A, Schork NJ. Comparison of phasing strategies for whole human genomes. PLoS Genet. 2018;14:e1007308. doi: 10.1371/journal.pgen.1007308. - DOI - PMC - PubMed
    1. Snyder MW, Adey A, Kitzman JO, Shendure J. Haplotype-resolved genome sequencing: experimental methods and applications. Nat Rev Genet. 2015;16:344–358. doi: 10.1038/nrg3903. - DOI - PubMed
    1. Miller D, Piccolo S. Compound heterozygous variants in pediatric cancers: a systematic review. Front Genet. 2020;11:493. doi: 10.3389/fgene.2020.00493. - DOI - PMC - PubMed
    1. Miller DB, Piccolo SR. a survey of compound heterozygous variants in pediatric cancers and structural birth defects. Front Genet. 2021;12:363. doi: 10.3389/fgene.2021.640242. - DOI - PMC - PubMed
    1. Delaneau O, Zagury J-F, Robinson MR, Marchini JL, Dermitzakis ET. Accurate, scalable and integrative haplotype estimation. Nat Commun. 2019;10:5436. doi: 10.1038/s41467-019-13225-y. - DOI - PMC - PubMed

LinkOut - more resources