Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan;25(1):115-125.
doi: 10.1111/1462-2920.16238. Epub 2022 Oct 18.

Metaproteomics to understand how microbiota function: The crystal ball predicts a promising future

Affiliations

Metaproteomics to understand how microbiota function: The crystal ball predicts a promising future

Jean Armengaud. Environ Microbiol. 2023 Jan.

Abstract

In the medical, environmental, and biotechnological fields, microbial communities have attracted much attention due to their roles and numerous possible applications. The study of these communities is challenging due to their diversity and complexity. Innovative methods are needed to identify the taxonomic components of individual microbiota, their changes over time, and to determine how microoorganisms interact and function. Metaproteomics is based on the identification and quantification of proteins, and can potentially provide this full picture. Due to the wide molecular panorama and functional insights it provides, metaproteomics is gaining momentum in microbiome and holobiont research. Its full potential should be unleashed in the coming years with progress in speed and cost of analyses. In this exploratory crystal ball exercise, I discuss the technical and conceptual advances in metaproteomics that I expect to drive innovative research over the next few years in microbiology. I also debate the concepts of 'microbial dark matter' and 'Metaproteomics-Assembled Proteomes (MAPs)' and present some long-term prospects for metaproteomics in clinical diagnostics and personalized medicine, environmental monitoring, agriculture, and biotechnology.

PubMed Disclaimer

Conflict of interest statement

The author has declared no conflict of interest.

Figures

FIGURE 1
FIGURE 1
Estimation of the molecular complexity of a human faecal sample. The numbers indicated are derived from published results (Grenga et al., 2022), obtained from 39 human faecal samples analysed in triplicate. The different groups of identified organisms are indicated with the number of entities (×1 for the host) and a list of representative species per group is mentioned. The number of annotated protein‐coding sequences per group (20,383 for the host) is mentioned and their ratios are represented with the external circle. The inner circle shows the protein biomass of each group assessed experimentally by metaproteomics.
FIGURE 2
FIGURE 2
Tentative estimation of the protein and peptide spaces of a human faecal sample. The protein space is directly derived from the number of organisms estimated in Figure 1 and focussed on the most abundant organisms. Peptides are considered with equate I/L as these two residues are indistinguishable by simple mass spectrometry, with an average of 24 peptides per protein without missed‐cleavage and 70 peptides per protein when considering 1 possible missed‐cleavage. An average of 5 variants per peptide sequence is taken into account for synthesized proteins from the 200 most abundant species to obtain the number of peptides & variants.
FIGURE 3
FIGURE 3
Metaproteomics can identify and quantify uncharted branches of the tree of life. Tandem mass spectrometry‐based proteotyping of organisms present in the sample can be based on taxa‐spectrum matches (TSMs) and taxon‐specific peptides (Hardouin et al., ; Lozano et al., 2022). In this figure focused on a theoretical archaeal enriched microbial community, the presence of an uncharacterized organism belonging to the Candidatus Lokyarchaeota phylum and another belonging to the Halobacteriales order are indicated, while other organisms characterized at the genus taxonomical rank are also confirmed. These organisms are identified from taxon‐specific peptides and TSMs at the different taxonomical ranks when querying a generalist database such as NCBInr. The ratios of organisms are established based on protein biomass values derived from the TSMs parameter and are reported in percentage compared to the signal interpreted for the phylum taxonomical rank. The values presented are imaginary and serve only to explain the concept. A decrease of TSMs along the taxonomical ranks may be observed due to the phylogenetic distance between the proteins from the organisms present in the sample and those from the organisms listed in the database.
FIGURE 4
FIGURE 4
Clinical and environmental diagnostics by metaproteomics, timelines and applications. A sample‐to‐result timeline is proposed based on previously published results (Hardouin et al., 2022), along with likely optimizations over the next decade. The fields of application of metaproteomics for diagnosis or routine analysis are schematized.

References

    1. Aakko, J. , Pietila, S. , Suomi, T. , Mahmoudian, M. , Toivonen, R. , Kouvonen, P. et al. (2020) Data‐independent acquisition mass spectrometry in metaproteomics of gut microbiota‐implementation and computational analysis. Journal of Proteome Research, 19, 432–436. - PubMed
    1. Andersen, T.O. , Kunath, B.J. , Hagen, L.H. , Arntzen, M.O. & Pope, P.B. (2021) Rumen metaproteomics: closer to linking rumen microbial function to animal productivity traits. Methods, 186, 42–51. - PubMed
    1. Armengaud, J. (2009) A perfect genome annotation is within reach with the proteomics and genomics alliance. Current Opinion in Microbiology, 12, 292–300. - PubMed
    1. Armengaud, J. , Happe, B. & Timmis, K.N. (1998) Genetic analysis of dioxin dioxygenase of Sphingomonas sp. Strain RW1: catabolic genes dispersed on the genome. Journal of Bacteriology, 180, 3954–3966. - PMC - PubMed
    1. Bassignani, A. , Plancade, S. , Berland, M. , Blein‐Nicolas, M. , Guillot, A. , Chevret, D. et al. (2021) Benefits of iterative searches of large databases to interpret large human gut metaproteomic data sets. Journal of Proteome Research, 20, 1522–1534. - PubMed

Publication types