Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Feb;37(2):173-83.
doi: 10.1002/gepi.21697. Epub 2012 Nov 17.

Incorporating network structure in integrative analysis of cancer prognosis data

Affiliations

Incorporating network structure in integrative analysis of cancer prognosis data

Jin Liu et al. Genet Epidemiol. 2013 Feb.

Abstract

In high-throughput cancer genomic studies, markers identified from the analysis of single datasets may have unsatisfactory properties because of low sample sizes. Integrative analysis pools and analyzes raw data from multiple studies, and can effectively increase sample size and lead to improved marker identification results. In this study, we consider the integrative analysis of multiple high-throughput cancer prognosis studies. In the existing integrative analysis studies, the interplay among genes, which can be described using the network structure, has not been effectively accounted for. In network analysis, tightly connected nodes (genes) are more likely to have related biological functions and similar regression coefficients. The goal of this study is to develop an analysis approach that can incorporate the gene network structure in integrative analysis. To this end, we adopt an AFT (accelerated failure time) model to describe survival. A weighted least squares approach, which has low computational cost, is adopted for estimation. For marker selection, we propose a new penalization approach. The proposed penalty is composed of two parts. The first part is a group MCP penalty, and conducts gene selection. The second part is a Laplacian penalty, and smoothes the differences of coefficients for tightly connected genes. A group coordinate descent approach is developed to compute the proposed estimate. Simulation study shows satisfactory performance of the proposed approach when there exist moderate-to-strong correlations among genes. We analyze three lung cancer prognosis datasets, and demonstrate that incorporating the network structure can lead to the identification of important genes and improved prediction performance.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Solution paths for a simulated dataset under scenario 1. Solid lines are for nonzero gene effects, and dotted lines are for zero effects. Solid lines with the same symbols are for the same gene.
Figure 2
Figure 2
Data analysis: frequency of canonical correlations.

Similar articles

Cited by

References

    1. Agathanggelou A, Bieche I, Ahmed-Choudhury J, Nicke B, Dammann R, Baksh S, Gao B, Minna JD, Downward J, Maher ER, Latif F. Identification of novel gene expression targets for the ras association domain family 1 (rassf1a) tumor suppressor gene in non-small cell lung cancer and neuroblastoma. Cancer Res. 2003;63(17):5344–5351. - PMC - PubMed
    1. Armstrong B, Hutchinson E, Unwin J, Fletcher T. Lung cancer risk after exposure to polycyclic aromatic hydrocarbons: A review and meta-analysis. Environ Health Perspect. 2004;112(9):970–978. - PMC - PubMed
    1. Bonner AE, Lemon WJ, You M. Gene expression signatures identify novel regulatory pathways during murine lung development: implications for lung tumorigenesis. J Med Genet. 2003;40(6):408–417. - PMC - PubMed
    1. Breheny P, Huang J. Coordinate descent algorithms for nonconvex penalized regression methods. Annals of Applied Statistics. 2011;5:232–253. - PMC - PubMed
    1. Chang X, Monitto CL, Demokan S, Kim MS, Chang SS, Zhong X, Califano JA, Sidransky D. Identification of hypermethylated genes associated with cisplatin resistance in human cancers. Cancer Res. 2010;70(7):2870–2879. - PMC - PubMed

Publication types