Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Feb 4:2022:baab087.
doi: 10.1093/database/baab087.

PSL-LCCL: a resource for subcellular protein localization in liver cancer cell line SK_HEP1

PSL-LCCL: a resource for subcellular protein localization in liver cancer cell line SK_HEP1

Fang Huang et al. Database (Oxford). .

Abstract

The characterization of subcellular protein localization provides a basis for further understanding cellular behaviors. A delineation of subcellular localization of proteins on cytosolic membrane-bound organelles in human liver cancer cell lines (hLCCLs) has yet to be performed. To obtain its proteome-wide view, we isolated and enriched six cytosolic membrane-bound organelles in one of the hLCCLs (SK_HEP1) and quantified their proteins using mass spectrometry. The vigorous selection of marker proteins and a machine-learning-based algorithm were implemented to localize proteins at cluster and neighborhood levels. We validated the performance of the proposed method by comparing the predicted subcellular protein localization with publicly available resources. The profiles enabled investigating the correlation of protein domains with their subcellular localization and colocalization of protein complex members. A subcellular proteome database for SK_HEP1, including (i) the subcellular protein localization and (ii) the subcellular locations of protein complex members and their interactions, was constructed. Our research provides resources for further research on hLCCLs proteomics. Database URL: http://www.igenetics.org.cn/project/PSL-LCCL/.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
A graphical abstract for the present study. Six cytosolic membrane-bound organelles were isolated and enriched from SK_HEP1, and proteins in each organelle were quantified by MS. A compiled list of marker proteins was clustered and trained using a machine-learning-based algorithm. All proteins were localized at the cluster and neighborhood levels, respectively, as shown in a hierarchical structure. The localization of protein domains and complexes was further investigated. The proteome for subcellular organelles was available at www.igenetics.org.cn/PSL-LCCL.
Figure 2.
Figure 2.
The annotation of the selected 1481 marker proteins. (A) A three-dimensional visualization for 18 clusters of marker proteins. Different colors represent clusters or neighborhoods. The number of marker proteins classified in each cluster was shown in brackets. Annotation for the subcellular compartments/organelles and their corresponding neighborhoods was present. ED, endosome; ER, endoplasmic reticulum, G, golgi apparatus; LY, lysosome; PM, plasma membrane; Mito, mitochondria; Mito. M., mitochondria membrane. (B) Differential expression of the selected marker proteins in organelles. R, replicate.
Figure 3.
Figure 3.
Subcellular localization of all identified proteins (n = 4464) at the cluster level. (A) The t-SNE classification for all proteins in a three-dimensional space. (B) The consistency of the classified proteins in replicate. (C) The classification accuracy of marker proteins at the cluster level. Classified: the number of marker proteins retained at each cluster level after filtering by threshold; predicted: the number of marker proteins correctly predicted. (D) The overlap of proteins classified with single cluster and proteins annotated with a single location in GO and UniProt. (E) Evaluation of single-cluster-classified proteins against proteins with single subcellular localization annotation from GO or UniProt.
Figure 4.
Figure 4.
Subcellular localization of the identified proteins (n = 4464) at the neighborhood level. (A) A projection of all identified proteins in a three-dimensional space at the neighborhood level. (B) A hierarchical network of the classification at both cluster and neighborhood levels. (C) The consistency of the classified proteins in replicate. (D) The classification accuracy of marker proteins at the neighborhood level. Classified: the number of marker proteins retained at each neighborhood level after threshold filtering; predicted: number of marker proteins correctly predicted. (E) The overlap of single-neighborhood-classified proteins and proteins annotated with a single location in GO and UniProt. (F) Evaluation of single neighborhood classified against proteins with single subcellular localization annotation from GO or UniProt.
Figure 5.
Figure 5.
The evaluation of subcellular protein localization in PSL-LCCL against the public databases. (A) An agreement of neighborhood-based classification of proteins in PSL-LCC with proteins having unique localization annotated in UniProt or GO (upper). An agreement between PSL-LCC classifications and UniProt and GO single-location proteins of each neighborhood (below). Union, the union of UniProt and GO; Intersection, the overlap proteins in UniProt and GO. (B) An agreement of the neighborhood-based classifications in PSL-LCCL and proteins having unique localization annotated in Human Cell Atlas (upper) and their corresponding reliability score (below). (C) An agreement of the neighborhood-based classification in PSL-LCCL and proteins localized in SubCellBarCode Orre et al. (8). (D) An agreement of proteins localized in mitochondria in PSL-LCCL with MitoCarta Rath et al. (28).
Figure 6.
Figure 6.
The effects of domains on subcellular proteins localization. (A) The localization of proteins with transmembrane domain in the hierarchical structure. Enriched locations (P < 0.05) are indicated. (B). Enrichment analysis for Pfam domains in our neighborhoods. The cutoff for fold enrichment was two (*adjusted P-value* < 0.05). (C). Protein domains significantly enriched in PSL-LCCL. (D) The number of protein domains that are significantly enriched in neighborhoods. (E) The domain of the retromer complex.
Figure 7.
Figure 7.
Subcellular localization for the protein complex. (A) The coverage of CORUM complexes in PSL-LCCL. CORUM: the comprehensive resource of mammalian protein complexes; full: all the protein complex members were present in our data; no coverage: members in the protein complex were not identified in our study. (B) A cumulative plot of the correlation for the protein complex members in CORUM. Random sampling was from a random sampling of correlations for nonprotein complex members. (C) The consistency of the localization of the protein complex members. (D) Examples of members of protein complexes located in the same neighborhood. (E) Examples of members of protein complexes located in different neighborhoods.

Similar articles

Cited by

References

    1. Lundberg E. and Borner G.H. (2019) Spatial proteomics: a powerful discovery tool for cell biology. Nat. Rev. Mol. Cell Biol., 20, 285–302. - PubMed
    1. Abdrabou A., Brandwein D. and Wang Z. (2020) Differential subcellular distribution and translocation of seven 14-3-3 isoforms in response to EGF and during the cell cycle. Int. J. Mol. Sci., 21, 318. - PMC - PubMed
    1. Bauer N.C., Doetsch P.W. and Corbett A.H. (2015) Mechanisms regulating protein localization. Traffic, 16, 1039–1061. - PubMed
    1. Boisvert F.-M., Lam Y.W., Lamont D. et al. (2010) A quantitative proteomics analysis of subcellular proteome localization and changes induced by DNA damage. Mol. Cell. Proteomics, 9, 457–470. - PMC - PubMed
    1. Beltran P.M.J., Mathias R.A. and Cristea I.M. (2016) A portrait of the human organelle proteome in space and time during cytomegalovirus infection. Cell Syst., 3, 361–373. - PMC - PubMed

Publication types