Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jun 15;32(12):i8-i17.
doi: 10.1093/bioinformatics/btw243.

What time is it? Deep learning approaches for circadian rhythms

Affiliations

What time is it? Deep learning approaches for circadian rhythms

Forest Agostinelli et al. Bioinformatics. .

Erratum in

Abstract

Motivation: Circadian rhythms date back to the origins of life, are found in virtually every species and every cell, and play fundamental roles in functions ranging from metabolism to cognition. Modern high-throughput technologies allow the measurement of concentrations of transcripts, metabolites and other species along the circadian cycle creating novel computational challenges and opportunities, including the problems of inferring whether a given species oscillate in circadian fashion or not, and inferring the time at which a set of measurements was taken.

Results: We first curate several large synthetic and biological time series datasets containing labels for both periodic and aperiodic signals. We then use deep learning methods to develop and train BIO_CYCLE, a system to robustly estimate which signals are periodic in high-throughput circadian experiments, producing estimates of amplitudes, periods, phases, as well as several statistical significance measures. Using the curated data, BIO_CYCLE is compared to other approaches and shown to achieve state-of-the-art performance across multiple metrics. We then use deep learning methods to develop and train BIO_CLOCK to robustly estimate the time at which a particular single-time-point transcriptomic experiment was carried. In most cases, BIO_CLOCK can reliably predict time, within approximately 1 h, using the expression levels of only a small number of core clock genes. BIO_CLOCK is shown to work reasonably well across tissue types, and often with only small degradation across conditions. BIO_CLOCK is used to annotate most mouse experiments found in the GEO database with an inferred time stamp.

Availability and implementation: All data and software are publicly available on the CircadiOmics web portal: circadiomics.igb.uci.edu/

Contacts: fagostin@uci.edu or pfbaldi@uci.edu

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Core clock genes and proteins and the corresponding transcription/translation negative feedback loop
Fig. 2.
Fig. 2.
Samples of synthetic signals in the BioCycleForm dataset. Signals in green are periodic; signals in red are aperiodic
Fig. 3.
Fig. 3.
Samples of synthetic signals in the BioCycleGauss dataset. Signals in green are periodic; signals in red are aperiodic
Fig. 4.
Fig. 4.
Samples of biological time series in the BioCycleReal dataset. Signals in green are periodic; signals in red are aperiodic. [Note the signals are spline-smoothed.]
Fig. 5.
Fig. 5.
Visualizations of the deep neural networks (DNNs)
Fig. 6.
Fig. 6.
ROC Curves of different methods on the BioCycleForm dataset
Fig. 7.
Fig. 7.
AUC at various signal-to-noise ratios (SNRs) on the BioCycleForm dataset. The lower the SNR the noisier the signal is
Fig. 8.
Fig. 8.
Accuracy of periodic/aperiodic classification at different p-value cutoffs on the BioCycleForm dataset
Fig. 9.
Fig. 9.
AUC at different levels of missing data

References

    1. Allison D.B. et al. (2002) A mixture model approach for the analysis of microarray gene expression data. Comput. Stat. Data Anal., 39, 1–20.
    1. Andrews J.L. et al. (2010) Clock and bmal1 regulate myod and are necessary for maintenance of skeletal muscle phenotype and function. Proc. Natl. Acad. Sci. USA, 107, 19090–19095. - PMC - PubMed
    1. Antunes L.C. et al. (2010) Obesity and shift work: chronobiological aspects. Nutr. Res. Rev., 23, 155–168. - PubMed
    1. Baldi P. (2012) Autoencoders, Unsupervised Learning, and Deep Architectures. J. Mach. Learn. Res., 27, 37–50 (Proceedings of 2011 ICML Workshop on Unsupervised and Transfer Learning).
    1. Baldi P., Sadowski P. (2014) The dropout learning algorithm. Artif. Intell., 210C, 78–122. - PMC - PubMed