Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 May 6;5(1):vbaf104.
doi: 10.1093/bioadv/vbaf104. eCollection 2025.

Poplar: a phylogenomics pipeline

Affiliations

Poplar: a phylogenomics pipeline

Elizabeth Koning et al. Bioinform Adv. .

Abstract

Motivation: Generating phylogenomic trees from the genomic data is essential in understanding biological systems. Each step of this complex process has received extensive attention and has been significantly streamlined over the years. Given the public availability of data, obtaining genomes for a wide selection of species is straightforward. However, analyzing that data to generate a phylogenomic tree is a multistep process with legitimate scientific and technical challenges, often requiring a significant input from a domain-area scientist.

Results: We present Poplar, a new, streamlined computational pipeline, to address the computational logistical issues that arise when constructing the phylogenomic trees. It provides a framework that runs state-of-the-art software for essential steps in the phylogenomic pipeline, beginning from a genome with or without an annotation, and resulting in a species tree. Running Poplar requires no external databases. In the execution, it enables parallelism for execution for clusters and cloud computing. The trees generated by Poplar match closely with state-of-the-art published trees. The usage and performance of Poplar is far simpler and quicker than manually running a phylogenomic pipeline.

Availability and implementation: Freely available on GitHub at https://github.com/sandialabs/poplar. Implemented using Python and supported on Linux.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no conflict of interest.

Figures

Figure 1.
Figure 1.
Illustration of the Poplar pipeline from input sequence to species tree.
Figure 2.
Figure 2.
Reference tree of Kickxellomycotina genomes, according to JGI’s MycoCosm.
Figure 3.
Figure 3.
Poplar generated tree of Kickxellomycotina genomes with triplet error rate of 0.594, according to one run of Poplar with 50 gene trees given the coding sequences.
Figure 4.
Figure 4.
Reference Rhodophyta tree from JGI’s MycoCosm.
Figure 5.
Figure 5.
Rhodophyta tree generated by Poplar with 200 gene trees given coding sequences as input.
Figure 6.
Figure 6.
Reference tree for NCBI commonly studied organisms, from NCBI.
Figure 7.
Figure 7.
Tree generated by Poplar from the NCBI genomes given 200 gene trees and coding sequence input.
Figure 8.
Figure 8.
Puccinia graminis reference tree from MycoCosm (top) and Poplar with 100 or 500 gene trees (bottom).
Figure 9.
Figure 9.
Graph of CPU usage during execution of Poplar on 14 Kickxellomycotina genomes using 500 gene trees.

Similar articles

References

    1. Ahrendt SR, Quandt CA, Ciobanu D et al. Leveraging single-cell genomics to expand the fungal tree of life. Nat Microbiol 2018;3:1417–28. - PMC - PubMed
    1. Altschul SF, Gish W, Miller W et al. Basic local alignment search tool. J Mol Biol 1990;215:403–10. - PubMed
    1. Amses KR, Simmons DR, Longcore Stephen JE et al. Diploid-dominant life cycles characterize the early evolution of fungi. Proc Natl Acad Sci USA 2022;119:e2116841119. - PMC - PubMed
    1. Armstrong J, Hickey G, Diekhans M et al. Progressive cactus is a multiple-genome aligner for the thousand-genome era. Nature 2020;587:246–51. 10.1038/s41586-020-2871-y - DOI - PMC - PubMed
    1. Assembly. Accession No. GCF_000004695.1, Dictyostelium discoideum. National Library of Medicine (US): National Center for Biotechnology Information, 2009. https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_000004695.1.

LinkOut - more resources