Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Feb 22;5(2):e9361.
doi: 10.1371/journal.pone.0009361.

Identifying prototypical components in behaviour using clustering algorithms

Affiliations

Identifying prototypical components in behaviour using clustering algorithms

Elke Braun et al. PLoS One. .

Abstract

Quantitative analysis of animal behaviour is a requirement to understand the task solving strategies of animals and the underlying control mechanisms. The identification of repeatedly occurring behavioural components is thereby a key element of a structured quantitative description. However, the complexity of most behaviours makes the identification of such behavioural components a challenging problem. We propose an automatic and objective approach for determining and evaluating prototypical behavioural components. Behavioural prototypes are identified using clustering algorithms and finally evaluated with respect to their ability to represent the whole behavioural data set. The prototypes allow for a meaningful segmentation of behavioural sequences. We applied our clustering approach to identify prototypical movements of the head of blowflies during cruising flight. The results confirm the previously established saccadic gaze strategy by the set of prototypes being divided into either predominantly translational or rotational movements, respectively. The prototypes reveal additional details about the saccadic and intersaccadic flight sections that could not be unravelled so far. Successful application of the proposed approach to behavioural data shows its ability to automatically identify prototypical behavioural components within a large and noisy database and to evaluate these with respect to their quality and stability. Hence, this approach might be applied to a broad range of behavioural and neural data obtained from different animals and in different contexts.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Hierarchical clustering approach.
A) Artificial data of a two dimensional feature set. B) Dendrogram of applying agglomerative hierarchical clustering using Ward's criterion on the data shown in A. The x-axis indicates individual data points from A. C) Joining costs plotted against the number of clusters. D) Differential joining costs for the interesting range of number of clusters. Costs increase significantly if the algorithm groups the data in less than five clusters.
Figure 2
Figure 2. K-means clustering approach.
A) Two dimensional artificial feature vectors to be clustered. B) Solid lines divide the feature space into Voronoi cells for the random centroid starting positions. Each of the cell's centroids is denoted by an individual marker. C) Voronoi plots of the nine steps needed by a k-means algorithm to find five stable clusters. The greyness of lines and markers indicates to which step of clustering they belong. D) The final clustering is shown in black above the data in grey. E,F) Results of clustering assuming an improper numbers of clusters.
Figure 3
Figure 3. Evaluations for k-means clustering the artificial data of Figures 1 and 2.
The number of clusters and the data sets are varied. A) Instability values. B) Mean quality values. C) Mean quality of mean set for each number of clusters, respectively.
Figure 4
Figure 4. Quality values and centroid visualisation for individual clusters within the complete artificial data set.
Clusters result from determining the mean set of four and five centroids, respectively. Visualisation of the five centroids as modified star plots, see text for explanation.
Figure 5
Figure 5. Calculating fly head velocity features.
A) The fly head fixed coordinate system used for calculating three translational and three rotational velocities. B) PCA analysis of the two different sets of Calliphora head velocity data. Part of the covered data variance in dependence on the number of principal components taken into account. C) Visualisation of the principal components sorted in decreasing order of variance content, each.
Figure 6
Figure 6. Hierarchical clustering of normalised Calliphora head velocity data.
Three different data segments containing 5000 data points each were clustered using Ward's joining criterion. A) Joining costs in dependence on the remaining number of clusters. B) The deviation of the cost function for the most interesting range of fewer than 100 clusters.
Figure 7
Figure 7. Criteria for validating the k-means clustering results for the normalised Calliphora head velocity data.
Instability A) within and B) between data set configurations and C) quality of clustering results in dependence on the number of clusters for varying data sets. D) Individual quality values for mean nine centroids of the complete data set.
Figure 8
Figure 8. Mean set of nine centroids calculated based on the normalised complete data set for Calliphora free flight head velocity data.
The part of the data in percent assigned to the individual centroid is given with each centroid.
Figure 9
Figure 9. Mean set of nine velocity prototypes for Calliphora head data.
Feature values are accomplished with physical units: m/s for translational, deg/ms for rotational velocities. Note the different scales for the rotationally and translationally dominated prototypes.
Figure 10
Figure 10. Segmentation of behavioural sequences into prototypical movements.
A) Overall occurring lengths of prototypical movements for the individual prototypes. B) Velocity data and individually assigned velocity prototypes (prototype numbers as in Figure 9) for an exemplary part of a fly head trajectory. C) Segmentation of the example trajectory into prototypical movements. For better visualisation the trajectory is projected into two dimensions, just yaw rotation is shown, and the four saccadic prototypes are summarized resulting in six remaining differently coloured prototypes.

Similar articles

Cited by

References

    1. Stephens GJ, Johnson-Kerner B, Bialek W, Ryu WS. Dimensionality and dynamics in the behavior of c. elegans. PLoS Comput Biol. 2008;4:e1000028. - PMC - PubMed
    1. Kern R, van Hateren J, Michaelis C, Lindemann JP, Egelhaaf M. Function of a fly motion-sensitive neuron matches eye movements during free flight. PLoS Biology. 2005;3:e171. - PMC - PubMed
    1. Sokolowski MB. Drosophila: Genetics meets behaviour. Nature Reviews Genetics. 2001;2:879–890. - PubMed
    1. Bucan M, Abel T. The mouse: genetics meets behaviour. Nature Reviews Genetics. 2002;3:114–123. - PubMed
    1. Fentress JC, Stilwell FP. Grammar of a movement sequence in inbred mice. Nature. 1973;244:52–53. - PubMed

Publication types