Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013;14 Suppl 8(Suppl 8):S3.
doi: 10.1186/1471-2105-14-S8-S3. Epub 2013 May 9.

Robust PCA based method for discovering differentially expressed genes

Affiliations

Robust PCA based method for discovering differentially expressed genes

Jin-Xing Liu et al. BMC Bioinformatics. 2013.

Abstract

How to identify a set of genes that are relevant to a key biological process is an important issue in current molecular biology. In this paper, we propose a novel method to discover differentially expressed genes based on robust principal component analysis (RPCA). In our method, we treat the differentially and non-differentially expressed genes as perturbation signals S and low-rank matrix A, respectively. Perturbation signals S can be recovered from the gene expression data by using RPCA. To discover the differentially expressed genes associated with special biological progresses or functions, the scheme is given as follows. Firstly, the matrix D of expression data is decomposed into two adding matrices A and S by using RPCA. Secondly, the differentially expressed genes are identified based on matrix S. Finally, the differentially expressed genes are evaluated by the tools based on Gene Ontology. A larger number of experiments on hypothetical and real gene expression data are also provided and the experimental results show that our method is efficient and effective.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The RPCA model of microarray data. The white and yellow blocks denote zero and near-zero in this figure. Red and blue blocks denote the perturbation signals.
Figure 2
Figure 2
The recognition accuracy of matrix  S with different  c. s1 denotes the recognition accuracy series with rank=5 and μ=0.05. s2 denotes the recognition accuracy series with rank=5 and μ=0.1. s3 denotes the recognition accuracy series with rank=10 and μ=0.05. s4 denotes the recognition accuracy series with rank=10 and μ=0.1.
Figure 3
Figure 3
The sample frequency of response to abiotic stimulus.

Similar articles

Cited by

References

    1. Wang B, Wong H, Huang DS. Inferring protein-protein interacting sites using residue conservation and evolutionary information. Protein and peptide letters. 2006;13(10):999. doi: 10.2174/092986606778777498. - DOI - PubMed
    1. Huang DS, Zhao XM, Huang GB, Cheung YM. Classifying protein sequences using hydropathy blocks. Pattern recognition. 2006;39(12):2293–2300. doi: 10.1016/j.patcog.2005.11.012. - DOI
    1. Wang L, Li PCH. Microfluidic DNA microarray analysis: A review. Analytica chimica acta. 2011;687(1):12–27. doi: 10.1016/j.aca.2010.11.056. - DOI - PubMed
    1. Liao JC, Boscolo R, Yang YL, Tran LM, Sabatti C, Roychowdhury VP. Network component analysis: reconstruction of regulatory signals in biological systems. Proceedings of the National Academy of Sciences. 2003;100(26):15522–15527. doi: 10.1073/pnas.2136632100. - DOI - PMC - PubMed
    1. Dueck D, Morris QD, Frey BJ. Multi-way clustering of microarray data using probabilistic sparse matrix factorization. Bioinformatics. 2005;21(suppl 1):i144–i151. doi: 10.1093/bioinformatics/bti1041. - DOI - PubMed

Publication types

MeSH terms