Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 25;20(1):14.
doi: 10.1186/s13015-025-00274-w.

NANUQ+: A divide-and-conquer approach to network estimation

Affiliations

NANUQ+: A divide-and-conquer approach to network estimation

Elizabeth S Allman et al. Algorithms Mol Biol. .

Abstract

Inference of a species network from genomic data remains a difficult problem, with recent progress mostly limited to the level-1 case. However, inference of the Tree of Blobs of a network, showing only the network's cut edges, can be performed for any network by TINNiK, suggesting a divide-and-conquer approach to network inference where the tree's multifurcations are individually resolved to give more detailed structure. Here we develop a method, NANUQ + , to quickly perform such a level-1 resolution. Viewed as part of the NANUQ pipeline for fast level-1 inference, this gives tools for both understanding when the level-1 assumption is likely to be met and for exploring all highly-supported resolutions to cycles.

Keywords: Level-1; Multispecies coalescent; Phylogenetic network; Tree of blobs.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not applicable. Competing interests: Not applicable.

Figures

Fig. 1
Fig. 1
A topological rooted binary phylogenetic network N+ (left) and its induced topological unrooted phylogenetic network N- (right). In N+ only hybrid edges are highlighted as directed, but all other edges are implicitly assumed to be directed away from the root. In N- all non-hybrid edges are undirected
Fig. 2
Fig. 2
The five quartet networks induced from the topological unrooted phylogenetic network N- of Fig. 1
Fig. 3
Fig. 3
An unrooted 7-sunlet on X={x1,,x7} with hybrid taxon x1 and taxa ordered counter-clockwise around the cycle
Fig. 4
Fig. 4
A network N- containing eight trivial and two non-trivial blobs (all shaded in gray) (top) and its tree of blobs T(N) (bottom). Note that T(N) contains two nodes of degree four arising from N-’s 4-blobs
Fig. 5
Fig. 5
An unrooted binary topological level-1 network N- on X={x1,,x10} containing two non-trivial cycle blobs (left) and the generalized sunlet network NC- induced by the 5-cycle C in N- (right). Here, X1={x1,x2}, X2={x4}, X3={x5}, X4={x6,x7,x10}, and X5={x3}
Algorithm 1
Algorithm 1
ResolveCycle
Algorithm 2
Algorithm 2
HeuristicResolveCycle, m>4
Algorithm 3
Algorithm 3
CombineCycles
Fig. 6
Fig. 6
A rooted level-1 network N+ (left), and the tree of blobs T(N) obtained from the unrooted network N- (right). Note that T(N) has four multifurcations, labeled b1,b2,b3, and b4, corresponding to the 4 cycles of N+
Fig. 7
Fig. 7
NANUQ splits graph and TINNiK tree of blobs for k=1.0, s=10000 gene trees, using α=10-6 and β=0.05. Notice that the NANUQ splits graph suggests a level-1 structure since the four blob structures are ‘darts,’ and the TINNiK tree of blobs matches the true tree of blobs depicted in Fig. 6
Fig. 8
Fig. 8
Resolutions of the four multifurcations of the tree of blobs in T(N) in Fig. 6 combined into a level-1 network for s=10000 gene trees and k=1.0 for α=10-6 and β=0.05 when using the true tree of blobs and the NANUQ distance. Internal node labels #Hi and THi indicate hybrid nodes and the tail of one hybrid edge, respectively
Fig. 9
Fig. 9
Resolutions of the four multifurcations in the true tree of blobs in T(N) in Fig. 6 for s=10000 gene trees and k=1.0 and with test levels α=10-26 and β=0.05, using the NANUQ distance. There is a 2-way tie for the best resolution of b1, with the alternatives shown in the top row. The unique resolutions for the three other high-degree multifurcations in T(N) are shown in row 2. Since resolving node 18 required creation of a 4-cycle, the hybrid node was placed at random (row 2, column 1). Internal node labels #Hi and THi indicate hybrid nodes and the tail of one hybrid edge
Fig. 10
Fig. 10
Simplex plot (left) and splits graph (center) produced by NANUQ for Leopardus data, with tests levels α=5·10-29, β=0.95. TINNiK tree of blobs (right) for the same data and test levels. Red triangles in the simplex plot indicate empirical quartet concordance factors judged as involving hybridization. In the splits graph, the blue blob b1 is a dart, consistent with the level-1 assumption underlying NANUQ, but the gold blob b2 is not. TINNiK makes no assumptions about underlying blob complexity
Fig. 11
Fig. 11
Residual-sums-of-squares for all 60 possible resolutions of nodes b1 (left) and b2 (right). For b1 there is a single optimal resolution, while for b2 there are five tied optimal resolutions
Fig. 12
Fig. 12
The five NANUQ+ level-1 networks obtained by resolving two degree-5 multifurcations in the TINNiK tree of blobs. The hybridization cycle resolving node b1 on the tree of blobs is consistent with a level-1 hypothesis and is present in all five optimal NANUQ+ networks. For node b2 there are five optimal cycle resolutions in the NANUQ+ framework, with the hybrid node placed at each of the articulation nodes of the cycle. Log-quartet-pseudolikelihood scores for each network are shown, with the optimal network reported in [11] at top, left

Similar articles

Cited by

References

    1. Kong S, Swofford DL, Kubatko LS. Inference of phylogenetic networks from sequence data using composite likelihood. Syst Biol. 2024. 10.1093/sysbio/syae054. - PubMed
    1. Wen D, Nakhleh L. Coestimating reticulate phylogenies and gene trees from multilocus sequence data. Syst Biol. 2017;67(3):439–57. 10.1093/sysbio/syx085. - PubMed
    1. Zhang C, Ogilvie HA, Drummond AJ, Stadler T. Bayesian inference of species networks from multilocus sequence data. Mol Biol Evol. 2017;35(2):504–17. 10.1093/molbev/msx307. - PMC - PubMed
    1. Solís-Lemus C, Ané C. Inferring phylogenetic networks with maximum pseudolikelihood under incomplete lineage sorting. PLoS Genet. 2016;12(3): e1005896. 10.1371/journal.pgen.1005896. - PMC - PubMed
    1. Yu Y, Nakhleh L. A maximum pseudo-likelihood approach for phylogenetic networks. BMC Genom. 2015. 10.1186/1471-2164-16-s10-s10. - PMC - PubMed

LinkOut - more resources