Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jul 5;52(W1):W215-W220.
doi: 10.1093/nar/gkae237.

DeepLoc 2.1: multi-label membrane protein type prediction using protein language models

Affiliations

DeepLoc 2.1: multi-label membrane protein type prediction using protein language models

Marius Thrane Ødum et al. Nucleic Acids Res. .

Abstract

DeepLoc 2.0 is a popular web server for the prediction of protein subcellular localization and sorting signals. Here, we introduce DeepLoc 2.1, which additionally classifies the input proteins into the membrane protein types Transmembrane, Peripheral, Lipid-anchored and Soluble. Leveraging pre-trained transformer-based protein language models, the server utilizes a three-stage architecture for sequence-based, multi-label predictions. Comparative evaluations with other established tools on a test set of 4933 eukaryotic protein sequences, constructed following stringent homology partitioning, demonstrate state-of-the-art performance. Notably, DeepLoc 2.1 outperforms existing models, with the larger ProtT5 model exhibiting a marginal advantage over the ESM-1B model. The web server is available at https://services.healthtech.dtu.dk/services/DeepLoc-2.1.

PubMed Disclaimer

Figures

Graphical Abstract
Graphical Abstract
Figure 1.
Figure 1.
An example display of the predictions from the web server. All results from the tables can be downloaded as a comma-separated file (CSV) at the top of the page, which includes the predictions for the subcellular localization, membrane protein type and sorting signals. The attention plot and attention values can be downloaded separately. The predicted subcellular localization, membrane protein type and sorting signal labels are listed, along with prediction score tables. The predicted locations and membrane protein types in the tables are highlighted in green, with the intensity of the color indicating the certainty of the prediction. If none of the scores surpasses the threshold, the label closest to its threshold is selected. Elevated values in the logo-like plot indicate important regions in the sequence for the subcellular localization prediction, potentially corresponding to sorting signals. This is intended as a general guideline, and for a more in-depth and precise analysis of these signals, specialized tools like SignalP (14), TargetP (15) or NetGPI (16) can be employed.

Similar articles

Cited by

References

    1. Alberts B., Johnson A., Lewis J., Raff M., Roberts K., Walter P. Membrane proteins. Molecular Biology of the Cell. 4th edn. 2002; Garland Science.
    1. Yin H., Flynn A.D. Drugging membrane protein interactions. Annu. Rev. Biom. Eng. 2016; 18:51–76. - PMC - PubMed
    1. Overington J.P., Al-Lazikani B., Hopkins A.L. How many drug targets are there. Nat. Rev. Drug Discov. 2006; 5:993–996. - PubMed
    1. Thumuluri V., Almagro Armenteros J.J., Johansen A.R., Nielsen H., Winther O. DeepLoc 2.0: multi-label subcellular localization prediction using protein language models. Nucleic Acids Res. 2022; 50:W228–W234. - PMC - PubMed
    1. Almagro Armenteros J.J., Sønderby C.K., Sønderby S.K., Nielsen H., Winther O. DeepLoc: prediction of protein subcellular localization using deep learning. Bioinformatics. 2017; 33:3387–3395. - PubMed

LinkOut - more resources