Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jul 1;21(3):610-624.
doi: 10.1093/biostatistics/kxy081.

Bayesian generalized biclustering analysis via adaptive structured shrinkage

Affiliations

Bayesian generalized biclustering analysis via adaptive structured shrinkage

Ziyi Li et al. Biostatistics. .

Abstract

Biclustering techniques can identify local patterns of a data matrix by clustering feature space and sample space at the same time. Various biclustering methods have been proposed and successfully applied to analysis of gene expression data. While existing biclustering methods have many desirable features, most of them are developed for continuous data and few of them can efficiently handle -omics data of various types, for example, binomial data as in single nucleotide polymorphism data or negative binomial data as in RNA-seq data. In addition, none of existing methods can utilize biological information such as those from functional genomics or proteomics. Recent work has shown that incorporating biological information can improve variable selection and prediction performance in analyses such as linear regression and multivariate analysis. In this article, we propose a novel Bayesian biclustering method that can handle multiple data types including Gaussian, Binomial, and Negative Binomial. In addition, our method uses a Bayesian adaptive structured shrinkage prior that enables feature selection guided by existing biological information. Our simulation studies and application to multi-omics datasets demonstrate robust and superior performance of the proposed method, compared to other existing biclustering methods.

Keywords: -omics data; Adaptive shrinkage prior; Bayesian; Biclustering; Biological information; Integrative analysis.

PubMed Disclaimer

Similar articles

Cited by

References

    1. Ben-Dor A., Chor B., Karp R. and Yakhini Z. (2003). Discovering local structure in gene expression data: the order-preserving submatrix problem. Journal of Computational Biology 10, 373–384. - PubMed
    1. Bergmann S., Ihmels J. and Barkai N. (2003). Iterative signature algorithm for the analysis of large-scale gene expression data. Physical Review E 67, 031902. - PubMed
    1. Caldas J. and Kaski S. (2008). Bayesian biclustering with the plaid model. In: IEEE Workshop on Machine Learning for Signal Processing, 2008. MLSP 2008. Cancun, Mexico:IEEE, pp. 291–296.
    1. Callister S. J., Barry R. C., Adkins J. N., Johnson E. T., Qian W., Webb-Robertson B.-J. M., Smith R. D. and Lipton M. S. (2006). Normalization approaches for removing systematic biases associated with mass spectrometry and label-free proteomics. Journal of Proteome Research 5, 277–286. - PMC - PubMed
    1. Chang C., Kundu S. and Long Q. (2018). Scalable Bayesian variable selection for structured high-dimensional data. Biometrics. doi: 10.1111/biom.12882. - DOI - PMC - PubMed

Publication types