Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017;112(517):342-350.
doi: 10.1080/01621459.2016.1139497. Epub 2017 May 3.

Promoting Similarity of Sparsity Structures in Integrative Analysis with Penalization

Affiliations

Promoting Similarity of Sparsity Structures in Integrative Analysis with Penalization

Yuan Huang et al. J Am Stat Assoc. 2017.

Abstract

For data with high-dimensional covariates but small sample sizes, the analysis of single datasets often generates unsatisfactory results. The integrative analysis of multiple independent datasets provides an effective way of pooling information and outperforms single-dataset and several alternative multi-datasets methods. Under many scenarios, multiple datasets are expected to share common important covariates, that is, the corresponding models have similarity in their sparsity structures. However, the existing methods do not have a mechanism to promote the similarity in sparsity structures in integrative analysis. In this study, we consider penalized variable selection and estimation in integrative analysis. We develop an L0-penalty based method, which explicitly promotes the similarity in sparsity structures. Computationally it is realized using a coordinate descent algorithm. Theoretically it has the selection and estimation consistency properties. Under a wide spectrum of simulation scenarios, it has identification and estimation performance comparable to or better than the alternatives. In the analysis of three lung cancer datasets with gene expression measurements, it identifies genes with sound biological implications and satisfactory prediction performance.

Keywords: L0 penalization; cancer genomic data; integrative analysis; sparsity structure; variable selection.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Analysis scheme: shaded rectangles represent the true and identified important covariates. Right: true sparsity structures. Middle: sparsity structures identified by composite MCP. Right: sparsity structures identified by the proposed method.

Similar articles

Cited by

References

    1. Argyriou A, Evgeniou T, Pontil M. Convex multi-task feature learning. Machine Learning. 2008;73(3):243–272.
    1. Breheny P, Huang J. Penalized methods for bi-level variable selection. Statistics and its Interface. 2009;2(3):369–380. - PMC - PubMed
    1. Dicker L, Huang B, Lin X. Variable selection and estimation with the seamless-l0 penalty. Statistica Sinica. 2012;23:929–962.
    1. Fan J, Lv J. Nonconcave penalized likelihood with NP-dimensionality. IEEE Transactions on Information Theory. 2011;57:5467–5484. - PMC - PubMed
    1. Fan J, Peng H. Nonconcave penalized likelihood with a diverging number of parameters. The Annals of Statistics. 2004;32:928–961.