Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Mar 16;114(1):52-59.
doi: 10.1093/jhered/esac054.

Reference genome of the Monkeyface Prickleback, Cebidichthys violaceus

Affiliations

Reference genome of the Monkeyface Prickleback, Cebidichthys violaceus

Daniel B Wright et al. J Hered. .

Abstract

Pricklebacks (Family Stichaeidae) are generally cold-temperate fishes most commonly found in the north Pacific. As part of the California Conservation Genomics Project (CCGP), we sequenced the genome of the Monkeyface Prickleback, Cebidichthys violaceus, to establish a genomic model for understanding phylogeographic patterns of marine organisms in California. These patterns, in turn, may inform the design of marine protected areas using dispersal models based on forthcoming population genomic data. The genome of C. violaceus is typical of many marine fishes at less than 1 Gb (genome size = 575.6 Mb), and our assembly is near-chromosome level (contig N50 = 1 Mb, scaffold N50 = 16.4 Mb, BUSCO completeness = 93.2%). Within the context of the CCGP, the genome will be used as a reference for future whole genome resequencing projects, enhancing our knowledge of the population structure of the species and more generally, the efficacy of marine protected areas as a primary conservation tool across California's marine ecosystems.

Keywords: CCGP; California Conservation Genomics Project; marine protected areas.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
A) Distribution of Monkeyface Prickleback, Cebidichthys violaceus. Monkeyface Pricklebacks are found in the intertidal and subtidal to a depth of 25 m from Southern Oregon, USA, to Northern Baja California, Mexico. The collection site of the sequenced individual, Pacific Grove California, is indicated by the star on the map. Inset is an illustration of C. violaceus (Illustration credit: Andrea Dingeldin). B) A Monkeyface Prickleback, C. violaceus (photo by M.H. Horn) and an image of Franklin Point California at low tide, an example of representative intertidal habitat for the Monkeyface Prickleback. C) Phylogenetic relationships of the nonmonophyletic family Stichaeidae based on 2,100 bp of cytb, 16s, and tomo4c4 genes (Kim et al. 2014). Bayesian posterior probabilities are indicated on nodes. C. violaceus is bolded and the letters after the names are H = herbivory, O = omnivory, C = carnivory. Evolution of herbivory (— — — —) and omnivory (............) are shown. Numbers in parentheses are the number of taxa contained in that branch. Some groups are collapsed into subfamilies. Asterisks indicate that this species (or species in that group) are found in California.
Fig. 2.
Fig. 2.
Visual overview of genome assembly metrics. A) K-mer spectra output generated from PacBio HiFi data without adapters using GenomeScope2.0. B) BlobToolKit Snail plot showing a graphical representation of the quality metrics presented in Table 2 for the Cebidichthys violaceus primary assembly. The plot circle represents the full size of the assembly. From the inside-out, the central plot covers length-related metrics. The red line represents the size of the longest scaffold; all other scaffolds are arranged in size-order moving clockwise around the plot and drawn in gray starting from the outside of the central plot. Dark and light orange arcs show the scaffold N50 and scaffold N90 values. The central light gray spiral shows the cumulative scaffold count with a white line at each order of magnitude. White regions in this area reflect the proportion of Ns in the assembly. The dark versus light blue area around it shows mean, maximum and minimum GC versus AT content at 0.1% intervals (Challis et al. 2020). Omni-C contact maps for the primary (C) and alternate (D) genome assembly generated with PretextSnapshot. Hi-C contact maps translate proximity of genomic regions in 3D space to contiguous linear organization. Each cell in the contact map corresponds to sequencing data supporting the linkage (or join) between 2 of such regions.

References

    1. Abdennur N, Mirny LA.. Cooler: scalable storage for Hi-C data and other genomically labeled arrays. Bioinformatics. 2020;36(1):311–316. - PMC - PubMed
    1. Allio R, Schomaker‐Bastos A, Romiguier J, Prosdocimi F, Nabholz B, Delsuc F.. 2020. MitoFinder: Efficient automated large‐scale extraction of mitogenomic data in target enrichment phylogenomics. Mol Ecol Resour. 20(4):892–905. doi:10.1111/1755-0998.13160. - DOI - PMC - PubMed
    1. Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL.. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10(1):1–9. - PMC - PubMed
    1. Challis R, Richards E, Rajan J, Cochrane G, Blaxter M.. BlobToolKit—interactive quality assessment of genome assemblies. G3. 2020;10(4):1361–1374. - PMC - PubMed
    1. Cheng H, Jarvis ED, Fedrigo O, Koepfli K-P, Urban L, Gemmell NJ, Li H.. Robust haplotype-resolved assembly of diploid individuals without parental data. 2021. [accessed 2022 Jul 1]. http://arxiv.org/abs/2109.04785 - PMC - PubMed

Publication types