Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2023 Sep 14;13(9):1318.
doi: 10.3390/brainsci13091318.

Data Mining of Microarray Datasets in Translational Neuroscience

Affiliations
Review

Data Mining of Microarray Datasets in Translational Neuroscience

Lance M O'Connor et al. Brain Sci. .

Abstract

Data mining involves the computational analysis of a plethora of publicly available datasets to generate new hypotheses that can be further validated by experiments for the improved understanding of the pathogenesis of neurodegenerative diseases. Although the number of sequencing datasets is on the rise, microarray analysis conducted on diverse biological samples represent a large collection of datasets with multiple web-based programs that enable efficient and convenient data analysis. In this review, we first discuss the selection of biological samples associated with neurological disorders, and the possibility of a combination of datasets, from various types of samples, to conduct an integrated analysis in order to achieve a holistic understanding of the alterations in the examined biological system. We then summarize key approaches and studies that have made use of the data mining of microarray datasets to obtain insights into translational neuroscience applications, including biomarker discovery, therapeutic development, and the elucidation of the pathogenic mechanisms of neurodegenerative diseases. We further discuss the gap to be bridged between microarray and sequencing studies to improve the utilization and combination of different types of datasets, together with experimental validation, for more comprehensive analyses. We conclude by providing future perspectives on integrating multi-omics, to advance precision phenotyping and personalized medicine for neurodegenerative diseases.

Keywords: biological samples; biomarker discovery; circular RNA (circRNA); long non-coding RNA (lncRNA); messenger RNA (mRNA); microRNA (miRNA); microarray analysis; multi-omics integration; therapeutic development; translational neuroscience.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Data mining of different types of RNA in various biological samples for translational neuroscience applications. (A) Various types of biological samples, including post-mortem brain tissues, CSF, peripheral blood, and human stem cells. (B) Different types of coding (mRNA) and non-coding RNA (miRNA, circRNA, and lncRNA) obtained from biological samples. (C) Data mining of microarray datasets associated with neurodegenerative diseases. Different genes detected by the microarray analysis are illustrated by different colors. (D) Translational neuroscience applications including drug discovery, the elucidation of disease mechanisms, and biomarker identification. The figure was created using BioRender.
Figure 2
Figure 2
Pipeline for the data mining of microarray gene expression. (A) Searching for suitable microarray datasets to be analyzed using web-based tools or command line scripts. Different genes identified by microarray analysis are represented by different colors. (B) Pre-processing of datasets via normalization, quality control, and feature selection. (C) Statistical tests to obtain DEGs with corresponding P-values (significance) and LogFC (fold-change). Downregulated genes are illustrated in red and upregulated genes are illustrated in purple. (D) Enrichment analysis and visualization using pathway analysis, gene set enrichment analysis (GSEA), and network analysis to provide a biological interpretation of the DEGs. Different pathways or functional annotations of the DEGs are illustrated by different colors. The figure was created using BioRender.

Similar articles

Cited by

References

    1. Negi A., Shukla A., Jaiswar A., Shrinet J., Jasrotia R.S. In: Chapter 6-Applications and Challenges of Microarray and RNA-Sequencing. Singh D.B., Pathak R.K.B.T.-B., editors. Academic Press; Cambridge, MA, USA: 2022. pp. 91–103.
    1. Costa C., Giménez-Capitán A., Karachaliou N., Rosell R. Comprehensive Molecular Screening: From the RT-PCR to the RNA-Seq. Transl. lung cancer Res. 2013;2:87–91. - PMC - PubMed
    1. Kukurba K.R., Montgomery S.B. RNA Sequencing and Analysis. Cold Spring Harb. Protoc. 2015;2015:951–969. doi: 10.1101/pdb.top084970. - DOI - PMC - PubMed
    1. Wu A.R., Neff N.F., Kalisky T., Dalerba P., Treutlein B., Rothenberg M.E., Mburu F.M., Mantalas G.L., Sim S., Clarke M.F., et al. Quantitative Assessment of Single-Cell RNA-Sequencing Methods. Nat. Methods. 2014;11:41–46. doi: 10.1038/nmeth.2694. - DOI - PMC - PubMed
    1. Mantione K.J., Kream R.M., Kuzelova H., Ptacek R., Raboch J., Samuel J.M., Stefano G.B. Comparing Bioinformatic Gene Expression Profiling Methods: Microarray and RNA-Seq. Med. Sci. Monit. Basic Res. 2014;20:138–142. - PMC - PubMed

LinkOut - more resources