Analysis of cancer gene expression data with an assisted robust marker identification approach
- PMID: 28913902
- PMCID: PMC5912176
- DOI: 10.1002/gepi.22066
Analysis of cancer gene expression data with an assisted robust marker identification approach
Abstract
Gene expression (GE) studies have been playing a critical role in cancer research. Despite tremendous effort, the analysis results are still often unsatisfactory, because of the weak signals and high data dimensionality. Analysis is often further challenged by the long-tailed distributions of the outcome variables. In recent multidimensional studies, data have been collected on GEs as well as their regulators (e.g., copy number alterations (CNAs), methylation, and microRNAs), which can provide additional information on the associations between GEs and cancer outcomes. In this study, we develop an ARMI (assisted robust marker identification) approach for analyzing cancer studies with measurements on GEs as well as regulators. The proposed approach borrows information from regulators and can be more effective than analyzing GE data alone. A robust objective function is adopted to accommodate long-tailed distributions. Marker identification is effectively realized using penalization. The proposed approach has an intuitive formulation and is computationally much affordable. Simulation shows its satisfactory performance under a variety of settings. TCGA (The Cancer Genome Atlas) data on melanoma and lung cancer are analyzed, which leads to biologically plausible marker identification and superior prediction.
Keywords: assisted analysis; cancer; gene expression; robustness.
© 2017 WILEY PERIODICALS, INC.
Figures
Similar articles
-
Inferring gene regulatory relationships with a high-dimensional robust approach.Genet Epidemiol. 2017 Jul;41(5):437-454. doi: 10.1002/gepi.22047. Epub 2017 May 2. Genet Epidemiol. 2017. PMID: 28464328 Free PMC article.
-
Deciphering the associations between gene expression and copy number alteration using a sparse double Laplacian shrinkage approach.Bioinformatics. 2015 Dec 15;31(24):3977-83. doi: 10.1093/bioinformatics/btv518. Epub 2015 Sep 3. Bioinformatics. 2015. PMID: 26342102 Free PMC article.
-
Robust semiparametric gene-environment interaction analysis using sparse boosting.Stat Med. 2019 Oct 15;38(23):4625-4641. doi: 10.1002/sim.8322. Epub 2019 Jul 29. Stat Med. 2019. PMID: 31359454 Free PMC article.
-
[Molecular classification and markers of malignant melanoma].Magy Onkol. 2013 Jun;57(2):73-8. Epub 2013 May 20. Magy Onkol. 2013. PMID: 23795351 Review. Hungarian.
-
Emerging Biomarkers in Cutaneous Melanoma.Mol Diagn Ther. 2018 Apr;22(2):203-218. doi: 10.1007/s40291-018-0318-z. Mol Diagn Ther. 2018. PMID: 29411301 Review.
Cited by
-
Integration strategies of multi-omics data for machine learning analysis.Comput Struct Biotechnol J. 2021 Jun 22;19:3735-3746. doi: 10.1016/j.csbj.2021.06.030. eCollection 2021. Comput Struct Biotechnol J. 2021. PMID: 34285775 Free PMC article. Review.
-
Hierarchical Ridge Regression for Incorporating Prior Information in Genomic Studies.J Data Sci. 2022 Jan;20(1):34-50. doi: 10.6339/21-jds1030. Epub 2021 Dec 13. J Data Sci. 2022. PMID: 36274755 Free PMC article.
-
Assisted gene expression-based clustering with AWNCut.Stat Med. 2018 Dec 20;37(29):4386-4403. doi: 10.1002/sim.7928. Epub 2018 Aug 9. Stat Med. 2018. PMID: 30094873 Free PMC article.
-
Integrative functional linear model for genome-wide association studies with multiple traits.Biostatistics. 2022 Apr 13;23(2):574-590. doi: 10.1093/biostatistics/kxaa043. Biostatistics. 2022. PMID: 33040145 Free PMC article.
-
A Selective Review of Multi-Level Omics Data Integration Using Variable Selection.High Throughput. 2019 Jan 18;8(1):4. doi: 10.3390/ht8010004. High Throughput. 2019. PMID: 30669303 Free PMC article. Review.
References
-
- Aggarwal CC, Hinneburg A, Keim DA. Lecture Notes in Computer Science. Springer; 2001. On the surprising behavior of distance metrics in high dimensional space; pp. 420–434.
-
- Bowman L. Doctors, researchers worry about accuracy of social security “death file”. 2011 www.dailyrepublic.com/usworld/doctors-researchers-worry-about-accuracy-o...
-
- Fall K, Stromberg F, Rosell J, Andren O, E V, Group, S.-E. R. P. C. Reliability of death certificates in prostate cancer patients. Scand J Urol Nephrol. 2008;42:352–357. - PubMed
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources