Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Apr 30;197(2):315-23.
doi: 10.1016/j.jneumeth.2011.02.014. Epub 2011 Mar 3.

Profiling a Caenorhabditis elegans behavioral parametric dataset with a supervised K-means clustering algorithm identifies genetic networks regulating locomotion

Affiliations

Profiling a Caenorhabditis elegans behavioral parametric dataset with a supervised K-means clustering algorithm identifies genetic networks regulating locomotion

Shijie Zhang et al. J Neurosci Methods. .

Abstract

Defining genetic networks underlying animal behavior in a high throughput manner is an important but challenging task that has not yet been achieved for any organism. Using Caenorhabditis elegans, we collected quantitative parametric data related to various aspects of locomotion from wild type and 31 mutant worm strains with single mutations in genes functioning in sensory reception, neurotransmission, G-protein signaling, neuromuscular control or other facets of motor regulation. We applied unsupervised and constrained K-means clustering algorithms to the data and found that the genes that clustered together due to the behavioral similarity of their mutants encoded proteins in the same signaling networks. This approach provides a framework to identify genes and genetic networks underlying worm neuromotor function in a high-throughput manner. A publicly accessible database harboring the visual and quantitative behavioral data collected in this study adds valuable information to the rapidly growing C. elegans databanks that can be employed in a similar context.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Schematic representation of the AQUABN system. Left panel: hardware and software of the AQUABN system. Right panel: workflow chart of the AQUABN software.
Figure 2
Figure 2
Clustering of genes according to the behavioral signature of their genetic mutants. The selected genes segregated into six clusters according to the behavioral signature of their mutants. Seed worm strains are indicated in bold font. While eighteen variants unambiguously fell into a single cluster without supervision, seven variants could fit into two clusters. In these cases, the two involved clusters are connected with lines and the variants are labeled with asterisks. The area of each section and the spatial position of each circular diagram are only for presenting purpose without biological meaning. egl-30 (lf) and egl-30 (gf) are a loss-of-function (md186) or a gain-of-function (js126) allele of egl-30, respectively. We also used two loss-of-function mutations of unc-29 (x29 and e193).
Figure 3
Figure 3
Self-aggregation of worm strains according to the behavioral signature of their genetic mutants using an unsupervised K-means clustering algorithm. The selected genes self-aggregated into five clusters and two isolated genes according to the behavioral signature of their mutants. The area of each section and the spatial position of each circular diagram are only for presenting purposes without biological meaning.
Figure 4
Figure 4
Clustering of genes according to the behavioral signature of their genetic mutants with ACKMCA. The selected genes segregated into six clusters with ACKMCA according to the behavioral signature of their mutants. Seed worm strains are indicated in bold font. The area of each section and the spatial position of each circular diagram are only for presenting purposes without biological meaning.
Figure 5
Figure 5
Effect of redundant/correlated data on supervised clustering genes according to the behavioral signature of their genetic mutants. The selected genes segregated into six clusters with ACKMCA according to the behavioral signature of their mutants. Seed worm strains are indicated in bold font. The area of each section and the spatial position of each circular diagram are only for presenting purposes without biological meaning.
Figure 6
Figure 6
Effect of data size on supervised clustering genes according to the behavioral signature of their genetic mutants. The selected genes segregated into five clusters with ACKMCA according to the behavioral signature of their mutants. Seed worm strains are indicated in bold font. The area of each section and the spatial position of each circular diagram are only for presenting purposes without biological meaning.

References

    1. Baek JH, Cosman P, Feng Z, Silver J, Schafer WR. Using machine vision to analyze and classify Caenorhabditis elegans behavioral phenotypes quantitatively. J Neurosci Methods. 2002;118:9–21. - PubMed
    1. Baruch L, Itzkovitz S, Golan-Mashiach M, Shapiro E, Segal E. Using expression profiles of Caenorhabditis elegans neurons to identify genes that mediate synaptic connectivity. PLoS Comput Biol. 2008;4:e1000120. - PMC - PubMed
    1. Bastiani C, Mendel J. WormBook. 2006. Heterotrimeric G proteins in C. elegans; pp. 1–25. - PMC - PubMed
    1. Bianchi L, Gerstbrein B, Frokjaer-Jensen C, Royal DC, Mukherjee G, et al. The neurotoxic MEC-4(d) DEG/ENaC sodium channel conducts calcium: implications for necrosis initiation. Nat Neurosci. 2004;7:1337–1344. - PubMed
    1. Cao P, Yuan Y, Pehek EA, Moise AR, Huang Y, et al. Alpha-synuclein disrupted dopamine homeostasis leads to dopaminergic neuron degeneration in Caenorhabditis elegans. PLoS ONE. 2010;5:e9312. - PMC - PubMed

Publication types

Substances

LinkOut - more resources