Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2018 May 9;19(Suppl 2):96.
doi: 10.1186/s12864-018-4466-7.

Phylogenetic signal from rearrangements in 18 Anopheles species by joint scaffolding extant and ancestral genomes

Affiliations
Comparative Study

Phylogenetic signal from rearrangements in 18 Anopheles species by joint scaffolding extant and ancestral genomes

Yoann Anselmetti et al. BMC Genomics. .

Abstract

Background: Genomes rearrangements carry valuable information for phylogenetic inference or the elucidation of molecular mechanisms of adaptation. However, the detection of genome rearrangements is often hampered by current deficiencies in data and methods: Genomes obtained from short sequence reads have generally very fragmented assemblies, and comparing multiple gene orders generally leads to computationally intractable algorithmic questions.

Results: We present a computational method, ADSEQ, which, by combining ancestral gene order reconstruction, comparative scaffolding and de novo scaffolding methods, overcomes these two caveats. ADSEQ provides simultaneously improved assemblies and ancestral genomes, with statistical supports on all local features. Compared to previous comparative methods, it runs in polynomial time, it samples solutions in a probabilistic space, and it can handle a significantly larger gene complement from the considered extant genomes, with complex histories including gene duplications and losses. We use ADSEQ to provide improved assemblies and a genome history made of duplications, losses, gene translocations, rearrangements, of 18 complete Anopheles genomes, including several important malaria vectors. We also provide additional support for a differentiated mode of evolution of the sex chromosome and of the autosomes in these mosquito genomes.

Conclusions: We demonstrate the method's ability to improve extant assemblies accurately through a procedure simulating realistic assembly fragmentation. We study a debated issue regarding the phylogeny of the Gambiae complex group of Anopheles genomes in the light of the evolution of chromosomal rearrangements, suggesting that the phylogenetic signal they carry can differ from the phylogenetic signal carried by gene sequences, more prone to introgression.

Keywords: Comparative genomics; Mosquito genomics; Scaffolding.

PubMed Disclaimer

Conflict of interest statement

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Figures

Fig. 1
Fig. 1
Input and output of ADSEQ. (Left) Input data: (1) a species tree with extant genomes (A, B and C) containing observed adjacencies (black link) and scaffolding gene adjacencies with a prior score (blue link); each grey box represents a gene. (2) reconciled gene trees representing evolutionary histories of gene families annotated by evolutionary events. (Right) Typical output: gene order across ancestral and extant genomes including new extant gene adjacencies with a posterior score (green link) between genes located at fragments extremities in the initial genome assemblies
Fig. 2
Fig. 2
Precision and recall statistics for scaffolding adjacencies on three artificially fragmented genomes (A.alb: A. albimanus, A.ara: A. arabiensis and A.dir: A. dirus). Left graph: results with 50% of reads. Right graph: results with all reads. The different methods results are plotted with the precision on the Y axis and the recall on the X axis. For ADSEQ and AD, results for three different adjacency support thresholds (0.1, 0.5 and 0.8) before genome linearization are plotted and represented with a color gradient. Note: A True Positive (TP) adjacency requires the proper orientation of both genes
Fig. 3
Fig. 3
A. species trees (left: X phylogeny, right: WG phylogeny) with rearrangements per adjacency as branch lengths (× 10−3). The pie chart for a given species represents the adjacency degree of the genes of this species: orange represents genes with no adjacency, light blue genes with adjacency degree of 1 and green genes with adjacency degree of 2. Moreover, the diameter of each chart is proportional to the number of genes in the corresponding species
Fig. 4
Fig. 4
Left: Number of segments in extant and ancestral genomes, according to three runs of ADSEQ in three different conditions. In the first run, we turn off the scaffolding mode on the X phylogeny, that is, it only constructs ancestral segments. The first column “XNS ext” thus describes the initial assembly, and “XNS anc” the assembly of ancestral genomes when reconstructed without extant scaffolding. In the second and third runs, the scaffolding mode was turned on, and run with the X phylogeny (“X ext” and “X anc”) and the WG phylogeny (“WG ext” and “WG anc”). Right: Number of rearrangements over all branches of the X phylogeny, with and without the scaffolding mode

References

    1. Zuckerkandl E, Pauling L. Molecules as documents of evolutionary history. J Theor Biol. 1965;8(2):357–66. doi: 10.1016/0022-5193(65)90083-4. - DOI - PubMed
    1. Springer MS, Gatesy J. The gene tree delusion. Mol Phylogenet Evol. 2016;94:1–33. doi: 10.1016/j.ympev.2015.07.018. - DOI - PubMed
    1. Doolittle WF, Brunet TDP. What Is the Tree of Life? PLOS Genet. 2016;12(4):e1005912. doi: 10.1371/journal.pgen.1005912. - DOI - PMC - PubMed
    1. Neafsey DE, Waterhouse RM, Abai MR, Aganezov SS, Alekseyev MA, Allen JE, et al. Mosquito genomics. Highly evolvable malaria vectors: the genomes of 16 Anopheles mosquitoes. Science. 2015;347(6217):1258522. doi: 10.1126/science.1258522. - DOI - PMC - PubMed
    1. Fontaine MC, Pease JB, Steele A, Waterhouse RM, Neafsey DE, Sharakhov IV, et al. Mosquito genomics. Extensive introgression in a malaria vector species complex revealed by phylogenomics. Science. 2015;347(6217):1258524. doi: 10.1126/science.1258524. - DOI - PMC - PubMed

Publication types

LinkOut - more resources