Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Nov 7;23(22):9014.
doi: 10.3390/s23229014.

Dual-Signal Feature Spaces Map Protein Subcellular Locations Based on Immunohistochemistry Image and Protein Sequence

Affiliations

Dual-Signal Feature Spaces Map Protein Subcellular Locations Based on Immunohistochemistry Image and Protein Sequence

Kai Zou et al. Sensors (Basel). .

Abstract

Protein is one of the primary biochemical macromolecular regulators in the compartmental cellular structure, and the subcellular locations of proteins can therefore provide information on the function of subcellular structures and physiological environments. Recently, data-driven systems have been developed to predict the subcellular location of proteins based on protein sequence, immunohistochemistry (IHC) images, or immunofluorescence (IF) images. However, the research on the fusion of multiple protein signals has received little attention. In this study, we developed a dual-signal computational protocol by incorporating IHC images into protein sequences to learn protein subcellular localization. Three major steps can be summarized as follows in this protocol: first, a benchmark database that includes 281 proteins sorted out from 4722 proteins of the Human Protein Atlas (HPA) and Swiss-Prot database, which is involved in the endoplasmic reticulum (ER), Golgi apparatus, cytosol, and nucleoplasm; second, discriminative feature operators were first employed to quantitate protein image-sequence samples that include IHC images and protein sequence; finally, the feature subspace of different protein signals is absorbed to construct multiple sub-classifiers via dimensionality reduction and binary relevance (BR), and multiple confidence derived from multiple sub-classifiers is adopted to decide subcellular location by the centralized voting mechanism at the decision layer. The experimental results indicated that the dual-signal model embedded IHC images and protein sequences outperformed the single-signal models with accuracy, precision, and recall of 75.41%, 80.38%, and 74.38%, respectively. It is enlightening for further research on protein subcellular location prediction under multi-signal fusion of protein.

Keywords: benchmark database; discriminative feature operators; dual signal; protein subcellular location prediction.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
The flowchart of dual-signal model. Abbreviation definitions: ER: endoplasmic reticulum; Golgi: Golgi apparatus; Cyto.: cytosol; Nucl.: nucleoplasm.
Figure 2
Figure 2
Experimental results for different shallow features of protein sequence.
Figure 3
Figure 3
Performance of shallow features fusion of protein sequences.
Figure 4
Figure 4
Experimental results of 10 dbs Daubechies filters in protein channel patch images. (a) Line plot of Haralick feature performance fluctuations in different Daubechies filters; (b) experimental results of Haralick and LBP concatenating.
Figure 5
Figure 5
Performance comparison of IHC image in the abstract and concatenate features.
Figure 6
Figure 6
The multi-classifier outputs result and visualization by centralized voting mechanism. (a) Multi-classifier ensemble with five-fold cross-validation; (b) performance fluctuations of different protein subcellular locations in the five-fold cross-validation; (cg) visualization of confusion matrix in four categories.
Figure 7
Figure 7
The experimental results of different feature space based on Leave-One-Out.

Similar articles

Cited by

References

    1. Marx V. Mapping proteins with spatial proteomics. Nat. Methods. 2015;12:815–819. doi: 10.1038/nmeth.3555. - DOI - PubMed
    1. Christopher J.A., Geladaki A., Dawson C.S., Vennard O.L., Lilley K.S. Subcellular Transcriptomics and Proteomics: A Comparative Methods Review. Mol. Cell. Proteom. 2022;21:100186. doi: 10.1016/j.mcpro.2021.100186. - DOI - PMC - PubMed
    1. Berger C.M., Gaume X., Bouvet P. The roles of nucleolin subcellular localization in cancer. Biochimie. 2015;113:78–85. doi: 10.1016/j.biochi.2015.03.023. - DOI - PubMed
    1. Hung M.-C., Link W. Protein localization in disease and therapy. J. Cell Sci. 2011;124:3381–3392. doi: 10.1242/jcs.089110. - DOI - PubMed
    1. Zhang T., Ding Y., Chou K.-C. Prediction of protein subcellular location using hydrophobic patterns of amino acid sequence. Comput. Biol. Chem. 2006;30:367–371. doi: 10.1016/j.compbiolchem.2006.08.003. - DOI - PubMed

LinkOut - more resources