Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Nov 1;13(11):jkad184.
doi: 10.1093/g3journal/jkad184.

Model Organism Modifier (MOM): a user-friendly Galaxy workflow to detect modifiers from genome sequencing data using Caenorhabditis elegans

Affiliations

Model Organism Modifier (MOM): a user-friendly Galaxy workflow to detect modifiers from genome sequencing data using Caenorhabditis elegans

Tatiana Maroilley et al. G3 (Bethesda). .

Abstract

Genetic modifiers are variants modulating phenotypic outcomes of a primary detrimental variant. They contribute to rare diseases phenotypic variability, but their identification is challenging. Genetic screening with model organisms is a widely used method for demystifying genetic modifiers. Forward genetics screening followed by whole genome sequencing allows the detection of variants throughout the genome but typically produces thousands of candidate variants making the interpretation and prioritization process very time-consuming and tedious. Despite whole genome sequencing is more time and cost-efficient, usage of computational pipelines specific to modifier identification remains a challenge for biological-experiment-focused laboratories doing research with model organisms. To facilitate a broader implementation of whole genome sequencing in genetic screens, we have developed Model Organism Modifier or MOM, a pipeline as a user-friendly Galaxy workflow. Model Organism Modifier analyses raw short-read whole genome sequencing data and implements tailored filtering to provide a Candidate Variant List short enough to be further manually curated. We provide a detailed tutorial to run the Galaxy workflow Model Organism Modifier and guidelines to manually curate the Candidate Variant Lists. We have tested Model Organism Modifier on published and validated Caenorhabditis elegans modifiers screening datasets. As whole genome sequencing facilitates high-throughput identification of genetic modifiers in model organisms, Model Organism Modifier provides a user-friendly solution to implement the bioinformatics analysis of the short-read datasets in laboratories without expertise or support in Bioinformatics.

Keywords: Caenorhabditis elegans; bioinformatics pipeline; galaxy; genetic screening; modifiers; short-read whole genome sequencing.

PubMed Disclaimer

Conflict of interest statement

Conflicts of interest The author(s) declare no conflict of interest.

Figures

Fig. 1.
Fig. 1.
Schematic representation of the main bioinformatics steps implemented in the Galaxy workflow MOM.
Fig. 2.
Fig. 2.
Overview of Galaxy and the workflow webpage. (a) Annotated screenshot of the main page on Galaxy showing on the left the toolshed, and on the right side, the History—displaying jobs status and files. (b) Annotated screenshot of the window allowing uploading files into a History on Galaxy. (c) Annotated screenshot of the Galaxy Workflow MOM main page displaying all necessary input files.
Fig. 3.
Fig. 3.
Overview of the Galaxy history. (a) Annotated screenshot of a History in Galaxy showing the different settings available to the user. (b) Annotated screenshot of a History in Galaxy showing some of the different statuses that a job can display: pending, running, done.
Fig. 4.
Fig. 4.
Overview of the IGV and how to use it for manual curation through visualization. (a) Annotated screenshot of the Search bar on IGV. (b) Annotated screenshot of the tracks as displayed on IGV when visualizing WGS datasets. (c) Screenshots of homozygous SNVs of good-quality as visualized on IGV. (d) Screenshots of SNVs of bad quality as visualized on IGV.

Similar articles

Cited by

References

    1. Amrit FRG, Ghazi A. Transcriptomic analysis of C. elegans RNA sequencing data through the tuxedo suite on the galaxy project. J Vis Exp. 2017;122:e55473. doi:10.3791/55473. - DOI - PMC - PubMed
    1. Andrews S. FastQC: a quality control tool for high throughput sequence data. 2010.
    1. Atxaerandio-Landa A, Arrieta-Gisasola A, Laorden L, Bikandi J, Garaizar J, Martinez-Malaxetxebarria I, Martinez-Ballesteros I. A practical bioinformatics workflow for routine analysis of bacterial WGS data. Microorganisms. 2022;10(12):2364. doi:10.3390/microorganisms10122364. - DOI - PMC - PubMed
    1. Batut B, van den Beek M, Doyle MA, Soranzo N. RNA-seq data analysis in galaxy. Methods Mol Biol. 2021;2284:367–392. doi:10.1007/978-1-0716-1307-8_20. - DOI - PubMed
    1. Blankenberg D, Von Kuster G, Bouvier E, Baker D, Afgan E, Stoler N; Galaxy Team; Taylor J, Nekrutenko A. Dissemination of scientific software with galaxy ToolShed. Genome Biol. 2014;15(2):403. doi:10.1186/gb4161. - DOI - PMC - PubMed

Publication types

Grants and funding

LinkOut - more resources