Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Sep 9;21(1):398.
doi: 10.1186/s12859-020-03731-y.

Automated classification of protein subcellular localization in immunohistochemistry images to reveal biomarkers in colon cancer

Affiliations

Automated classification of protein subcellular localization in immunohistochemistry images to reveal biomarkers in colon cancer

Zhen-Zhen Xue et al. BMC Bioinformatics. .

Abstract

Background: Protein biomarkers play important roles in cancer diagnosis. Many efforts have been made on measuring abnormal expression intensity in biological samples to identity cancer types and stages. However, the change of subcellular location of proteins, which is also critical for understanding and detecting diseases, has been rarely studied.

Results: In this work, we developed a machine learning model to classify protein subcellular locations based on immunohistochemistry images of human colon tissues, and validated the ability of the model to detect subcellular location changes of biomarker proteins related to colon cancer. The model uses representative image patches as inputs, and integrates feature engineering and deep learning methods. It achieves 92.69% accuracy in classification of new proteins. Two validation datasets of colon cancer biomarkers derived from published literatures and the human protein atlas database respectively are employed. It turns out that 81.82 and 65.66% of the biomarker proteins can be identified to change locations.

Conclusions: Our results demonstrate that using image patches and combining predefined and deep features can improve the performance of protein subcellular localization, and our model can effectively detect biomarkers based on protein subcellular translocations. This study is anticipated to be useful in annotating unknown subcellular localization for proteins and discovering new potential location biomarkers.

Keywords: Bioimage processing; Bioinformatics; Cancer biomarkers; Machine learning; Protein subcellular location.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Framework of the experiments in this paper. a Training classifier models using IHC images. b Identifying location biomarker proteins using integrated model
Fig. 2
Fig. 2
Classification results of using whole images with different image separation methods and features
Fig. 3
Fig. 3
Classification results of using image patches. a Results of using different combinations of patch parameters. b Comparison of results between using whole images and using patches of 75*75 pixels. Features of db4 were used here
Fig. 4
Fig. 4
Examples of image patches of proteins in the three subcellular location classes. The patches in the left three columns have 75*75 pixels, while in the right three columns have 224*224 pixels
Fig. 5
Fig. 5
Classification results of SVM models and CNN models
Fig. 6
Fig. 6
Comparison of intensity distance distribution between detected and undetected biomarkers in the HPA biomarker dataset

Similar articles

Cited by

References

    1. Breker M, Schuldiner M. The emergence of proteome-wide technologies: systematic analysis of proteins comes of age. Nat Rev Mol Cell Biol. 2014;15(7):453–464. - PubMed
    1. Hung MC, Link W. Protein localization in disease and therapy. J Cell Sci. 2011;124(20):3381–3392. - PubMed
    1. Casanova I, Parreno M, Farre L, Guerrero S, Cespedes MV, Pavon MA, et al. Celecoxib induces anoikis in human colon carcinoma cells associated with the deregulation of focal adhesions and nuclear translocation of p130Cas. Int J Cancer. 2006;118(10):2381–2389. - PubMed
    1. Lin Y-Y, Hsu Y-H, Huang H-Y, Shann Y-J, Huang C-YF, Wei S-C, et al. Aberrant nuclear localization of EBP50 promotes colorectal carcinogenesis in xenotransplanted mice by modulating TCF-1 and β-catenin interactions. J Clin Invest. 2012;122(5):1881–1894. - PMC - PubMed
    1. Huang Y, Wang G, Liang Z, Yang Y, Cui L, Liu CY. Loss of nuclear localization of TET2 in colorectal cancer. Clin Epigenetics. 2016;8(1):9–9. - PMC - PubMed

LinkOut - more resources