Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 May 24;376(1825):20200157.
doi: 10.1098/rstb.2020.0157. Epub 2021 Apr 5.

MolluscDB: a genome and transcriptome database for molluscs

Affiliations

MolluscDB: a genome and transcriptome database for molluscs

Carlos Caurcel et al. Philos Trans R Soc Lond B Biol Sci. .

Abstract

As sequencing becomes more accessible and affordable, the analysis of genomic and transcriptomic data has become a cornerstone of many research initiatives. Communities with a focus on particular taxa or ecosystems need solutions capable of aggregating genomic resources and serving them in a standardized and analysis-friendly manner. Taxon-focussed resources can be more flexible in addressing the needs of a research community than can universal or general databases. Here, we present MolluscDB, a genome and transcriptome database for molluscs. MolluscDB offers a rich ecosystem of tools, including an Ensembl browser, a BLAST server for homology searches and an HTTP server from which any dataset present in the database can be downloaded. To demonstrate the utility of the database and verify the quality of its data, we imported data from assembled genomes and transcriptomes of 22 species, estimated the phylogeny of Mollusca using single-copy orthologues, explored patterns of gene family size change and interrogated the data for biomineralization-associated enzymes and shell matrix proteins. MolluscDB provides an easy-to-use and openly accessible data resource for the research community. This article is part of the Theo Murphy meeting issue 'Molluscan genomics: broad insights and future directions for a neglected phylum'.

Keywords: database; genome; molluscs; protein families; shell matrix proteins; transcriptome.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Protein families in species represented in MolluscDB. (a) Stacked histogram of proteins in each taxon analysed assigned to: ‘shared’: proteins in clusters containing proteins from multiple taxa; ‘specific’: proteins in clusters containing two or more proteins from a single proteome; and ‘singleton’: proteins in single protein clusters. (b) Frequency plot of cluster size in the OrthoFinder clustering of 214 608 orthogroups. (Online version in colour.)
Figure 2.
Figure 2.
Phylogenetic tree of taxa in MolluscDB. Multilocus phylogeny of the species analysed. Support is 100 at all nodes except indicated.

References

    1. Muir P, et al. . 2016. The real cost of sequencing: scaling computation to keep pace with data generation. Genome Biol. 17, 53. (10.1186/s13059-016-0917-0) - DOI - PMC - PubMed
    1. Gomes-dos-Santos A, Lopes-Lima M, Castro LFC, Froufe E. 2020. Molluscan genomics: the road so far and the way forward. Hydrobiologia 847, 1705-1726. (10.1007/s10750-019-04111-1) - DOI
    1. González VL, Andrade SCS, Bieler R, Collins TM, Dunn CW, Mikkelsen PM, Taylor JD, Giribet G. 2015. A phylogenetic backbone for Bivalvia: an RNA-seq approach. Proc. R. Soc. B 282, 20142332. (10.1098/rspb.2014.2332) - DOI - PMC - PubMed
    1. Kocot KM, et al. . 2017. Phylogenomics of Lophotrochozoa with consideration of systematic error. Syst. Biol. 66, 256-282. (10.1093/sysbio/syw079) - DOI - PubMed
    1. Yates AD, et al. . 2020. Ensembl 2020. Nucleic Acids Res. 48, D682-D688. (10.1093/nar/gkz1138) - DOI - PMC - PubMed

Publication types