Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2005 May 13:6:117.
doi: 10.1186/1471-2105-6-117.

Robust detection of periodic time series measured from biological systems

Affiliations

Robust detection of periodic time series measured from biological systems

Miika Ahdesmäki et al. BMC Bioinformatics. .

Abstract

Background: Periodic phenomena are widespread in biology. The problem of finding periodicity in biological time series can be viewed as a multiple hypothesis testing of the spectral content of a given time series. The exact noise characteristics are unknown in many bioinformatics applications. Furthermore, the observed time series can exhibit other non-idealities, such as outliers, short length and distortion from the original wave form. Hence, the computational methods should preferably be robust against such anomalies in the data.

Results: We propose a general-purpose robust testing procedure for finding periodic sequences in multiple time series data. The proposed method is based on a robust spectral estimator which is incorporated into the hypothesis testing framework using a so-called g-statistic together with correction for multiple testing. This results in a robust testing procedure which is insensitive to heavy contamination of outliers, missing-values, short time series, nonlinear distortions, and is completely insensitive to any monotone nonlinear distortions. The performance of the methods is evaluated by performing extensive simulations. In addition, we compare the proposed method with another recent statistical signal detection estimator that uses Fisher's test, based on the Gaussian noise assumption. The results demonstrate that the proposed robust method provides remarkably better robustness properties. Moreover, the performance of the proposed method is preferable also in the standard Gaussian case. We validate the performance of the proposed method on real data on which the method performs very favorably.

Conclusion: As the time series measured from biological systems are usually short and prone to contain different kinds of non-idealities, we are very optimistic about the multitude of possible applications for our proposed robust statistical periodicity detection method.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Examples of time series. An example of a time series composed of a sine and (a) additive standard Gaussian noise, (b) additive standard Gaussian and impulsive noise, and (c) additive standard Gaussian noise and cubic distortion.
Figure 2
Figure 2
Examples of spectral estimates. The spectral estimates for the time series in Figures 1 (a)-(c), respectively, using both the standard periodogram and the proposed robust method.
Figure 3
Figure 3
Power of the test. The power of the tests (y-axis) for the three different test cases as the function of the time series length and varying noise parameters (x-axis). The solid (resp. dashed) line corresponds to the proposed robust method (resp. Fisher's test). Three different types of non-idealities are considered, namely, pure standard Gaussian noise (the first row), standard Gaussian and impulsive noise (the second row), and standard Gaussian noise and x3 distortion (the third row). The left (resp. right) column shows the results for different time series lengths (resp. different values of the noise parameters).
Figure 4
Figure 4
Benchmark results. The fraction of the benchmark set that is identified (y-axis) as the function of the highest ranked genes (x-axis). The solid (resp. dashed) line corresponds the robust detection having fixed (resp. unknown) frequency. The dotted line shows the performance of the random gene selection. The columns from left to right correspond to the Alpha, the Cdcl5 and the Cdc28 experiment by [6]. The rows, from top to bottom, correspond to the three different benchmark gene sets B1, B2 and B3. for more details about the benchmark gene sets, see [24].

References

    1. Tyson JJ. Biochemical oscillations. In: Fall C, Marland E, Wagner J, Tyson J, editor. Computational Cell Biology: An Introductory Text on Computer Modeling in Molecular and Cell Biology. New York: Springer-Verlag; 2002.
    1. Breeden LL. Periodic transcription: a cycle within a cycle. Curr Biol. 2003;13:R31–R38. doi: 10.1016/S0960-9822(02)01386-6. - DOI - PubMed
    1. Correa A, Lewis ZA, Greene AV, March IJ, Gomer RH, Bell-Pedersen D. Multiple oscillators regulate circadian gene expression in Neurospora. Proc Natl Acad Sci USA. 2003;100:13597–13602. doi: 10.1073/pnas.2233734100. - DOI - PMC - PubMed
    1. Sherr CJ. Cancer cell cycles. Science. 1996;274:1672–1677. doi: 10.1126/science.274.5293.1672. - DOI - PubMed
    1. Whitfield ML, Sherlock G, Saldanha AJ, Murray JI, Ball CA, Alexander KE, Matese JC, Perou CM, Hurt MM, Brown PO, Botstein D. Identification of genes periodically expressed in the human cell cycle and their expression in tumors. Mol Biol Cell. 2002;13:1977–2000. doi: 10.1091/mbc.02-02-0030.. - DOI - PMC - PubMed

Publication types