. 2014 Aug 12;9(8):e104314.

doi: 10.1371/journal.pone.0104314. eCollection 2014.

Learning a weighted meta-sample based parameter free sparse representation classification for microarray data

Bo Liao¹, Yan Jiang¹, Guanqun Yuan¹, Wen Zhu¹, Lijun Cai¹, Zhi Cao¹

Affiliations

Affiliation

¹ Key Laboratory for Embedded and Network Computing of Hunan Province, the College of Information Science and Engineering, Hunan University, Changsha Hunan, China.

PMID: 25115965
PMCID: PMC4130588
DOI: 10.1371/journal.pone.0104314

Learning a weighted meta-sample based parameter free sparse representation classification for microarray data

Bo Liao et al. PLoS One. 2014.

. 2014 Aug 12;9(8):e104314.

doi: 10.1371/journal.pone.0104314. eCollection 2014.

Authors

Bo Liao¹, Yan Jiang¹, Guanqun Yuan¹, Wen Zhu¹, Lijun Cai¹, Zhi Cao¹

Affiliation

¹ Key Laboratory for Embedded and Network Computing of Hunan Province, the College of Information Science and Engineering, Hunan University, Changsha Hunan, China.

PMID: 25115965
PMCID: PMC4130588
DOI: 10.1371/journal.pone.0104314

Abstract

Sparse representation classification (SRC) is one of the most promising classification methods for supervised learning. This method can effectively exploit discriminating information by introducing a [Symbol: see text]1 regularization terms to the data. With the desirable property of sparisty, SRC is robust to both noise and outliers. In this study, we propose a weighted meta-sample based non-parametric sparse representation classification method for the accurate identification of tumor subtype. The proposed method includes three steps. First, we extract the weighted meta-samples for each sub class from raw data, and the rationality of the weighting strategy is proven mathematically. Second, sparse representation coefficients can be obtained by [Symnbol: see text]1 regularization of underdetermined linear equations. Thus, data dependent sparsity can be adaptively tuned. A simple characteristic function is eventually utilized to achieve classification. Asymptotic time complexity analysis is applied to our method. Compared with some state-of-the-art classifiers, the proposed method has lower time complexity and more flexibility. Experiments on eight samples of publicly available gene expression profile data show the effectiveness of the proposed method.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1. Illustration of meta-sample model: each column vector of can be represented within a linear combination of meta-samples in , and the column of corresponds to the linear combination coefficients.

**Figure 2. Optimal classification accuracy of MSRC achieved on COLON; the -axis represents the number of meta-samples (left) and the regularization parameter (right).**
Classification accuracy is more sensitive to the number of meta-samples rather than to the regularization parameter.

**Figure 3. The flowchart of PFMSRC scheme.**

Figure 4. Comparison of prediction accuracy on four binary classification datasets by varying the number of samples from per subclass; when is larger than 10 the model based method prediction accuracy decreases as increases.

Figure 5. Comparison of prediction accuracy on four multiclass classification datasets by varying the number of samples from per subclass; when is larger than 10 the performance degradation of model based methods is less significant than that of binary classification.

**Figure 6. Comparison of prediction accuracy on four binary classification datasets by varying the number of top selected genes.**

**Figure 7. Comparison of prediction accuracy on four multiclass classification datasets by varying the number of top selected genes.**

See this image and copyright information in PMC

Cited by

Gene function prediction based on combining gene ontology hierarchy with multi-instance multi-label learning.
Li Z, Liao B, Li Y, Liu W, Chen M, Cai L. Li Z, et al. RSC Adv. 2018 Aug 10;8(50):28503-28509. doi: 10.1039/c8ra05122d. eCollection 2018 Aug 7. RSC Adv. 2018. PMID: 35542493 Free PMC article.
Protein Sub-Nuclear Localization Based on Effective Fusion Representations and Dimension Reduction Algorithm LDA.
Wang S, Liu S. Wang S, et al. Int J Mol Sci. 2015 Dec 19;16(12):30343-61. doi: 10.3390/ijms161226237. Int J Mol Sci. 2015. PMID: 26703574 Free PMC article.
Fisher Discrimination Regularized Robust Coding Based on a Local Center for Tumor Classification.
Li W, Liao B, Zhu W, Chen M, Li Z, Wei X, Peng L, Huang G, Cai L, Chen H. Li W, et al. Sci Rep. 2018 Jun 14;8(1):9152. doi: 10.1038/s41598-018-27364-7. Sci Rep. 2018. PMID: 29904059 Free PMC article.
Maxdenominator Reweighted Sparse Representation for Tumor Classification.
Li W, Liao B, Zhu W, Chen M, Peng L, Wei X, Gu C, Li K. Li W, et al. Sci Rep. 2017 Apr 10;7:46030. doi: 10.1038/srep46030. Sci Rep. 2017. PMID: 28393883 Free PMC article.
Identification of DNA-protein Binding Sites through Multi-Scale Local Average Blocks on Sequence Information.
Shen C, Ding Y, Tang J, Song J, Guo F. Shen C, et al. Molecules. 2017 Nov 28;22(12):2079. doi: 10.3390/molecules22122079. Molecules. 2017. PMID: 29182548 Free PMC article.

See all "Cited by" articles

References

1. Alizadeh AA, Eisen MB, Davis RE, Ma C, Lossos IS, et al. (2000) Distinct types of diffuse large b-cell lymphoma identified by gene expression profiling. Nature 403: 503–511. - PubMed
1. West M (2003) Bayesian factor regression models in the large p, small n paradigm. Bayesian statistics 7: 723–732.
1. Liu B, Fang B, Liu X, Chen J, Huang Z (2013) Large margin subspace learning for feature selection. Pattern Recognition.
1. Cai D, He X, Zhou K, Han J, Bao H (2007) Locality sensitive discriminant analysis. In: IJCAI. pp. 708–713.
1. Sugiyama M (2006) Local fisher discriminant analysis for supervised dimensionality reduction. In: Proceedings of the 23rd international conference on Machine learning ACM, pp. 905–912.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Learning a weighted meta-sample based parameter free sparse representation classification for microarray data

Affiliation

Learning a weighted meta-sample based parameter free sparse representation classification for microarray data

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Miscellaneous