A Parallelization Strategy for the Time Efficient Analysis of Thousands of LC/MS Runs in High-Performance Computing Environment
- PMID: 36201825
- PMCID: PMC9930095
- DOI: 10.1021/acs.jproteome.2c00278
A Parallelization Strategy for the Time Efficient Analysis of Thousands of LC/MS Runs in High-Performance Computing Environment
Abstract
Combining robust proteomics instrumentation with high-throughput enabling liquid chromatography (LC) systems (e.g., timsTOF Pro and the Evosep One system, respectively) enabled mapping the proteomes of 1000s of samples. Fragpipe is one of the few computational protein identification and quantification frameworks that allows for the time-efficient analysis of such large data sets. However, it requires large amounts of computational power and data storage space that leave even state-of-the-art workstations underpowered when it comes to the analysis of proteomics data sets with 1000s of LC mass spectrometry runs. To address this issue, we developed and optimized a Fragpipe-based analysis strategy for a high-performance computing environment and analyzed 3348 plasma samples (6.4 TB) that were longitudinally collected from hospitalized COVID-19 patients under the auspice of the Immunophenotyping Assessment in a COVID-19 Cohort (IMPACC) study. Our parallelization strategy reduced the total runtime by ∼90% from 116 (theoretical) days to just 9 days in the high-performance computing environment. All code is open-source and can be deployed in any Simple Linux Utility for Resource Management (SLURM) high-performance computing environment, enabling the analysis of large-scale high-throughput proteomics studies.
Keywords: Fragpipe; HPC; SLURM; parallelization; proteomics; timsTOF.
Conflict of interest statement
Conflicts of interest
N/A
Figures


Similar articles
-
dia-PASEF data analysis using FragPipe and DIA-NN for deep proteomics of low sample amounts.Nat Commun. 2022 Jul 8;13(1):3944. doi: 10.1038/s41467-022-31492-0. Nat Commun. 2022. PMID: 35803928 Free PMC article.
-
MZDASoft: a software architecture that enables large-scale comparison of protein expression levels over multiple samples based on liquid chromatography/tandem mass spectrometry.Rapid Commun Mass Spectrom. 2015 Oct 15;29(19):1841-8. doi: 10.1002/rcm.7272. Rapid Commun Mass Spectrom. 2015. PMID: 26331936 Free PMC article.
-
Zwitter-ionic monolith-based spintip column coupled with Evosep One liquid chromatography for high-throughput proteomic analysis.J Chromatogr A. 2022 Jul 19;1675:463122. doi: 10.1016/j.chroma.2022.463122. Epub 2022 May 13. J Chromatogr A. 2022. PMID: 35623190
-
On the potential of micro-flow LC-MS/MS in proteomics.Expert Rev Proteomics. 2022 Mar;19(3):153-164. doi: 10.1080/14789450.2022.2134780. Epub 2022 Oct 18. Expert Rev Proteomics. 2022. PMID: 36221222 Review.
-
Advances and challenges in liquid chromatography-mass spectrometry-based proteomics profiling for clinical applications.Mol Cell Proteomics. 2006 Oct;5(10):1727-44. doi: 10.1074/mcp.M600162-MCP200. Epub 2006 Aug 3. Mol Cell Proteomics. 2006. PMID: 16887931 Free PMC article. Review.
Cited by
-
MD-Ligand-Receptor: A High-Performance Computing Tool for Characterizing Ligand-Receptor Binding Interactions in Molecular Dynamics Trajectories.Int J Mol Sci. 2023 Jul 19;24(14):11671. doi: 10.3390/ijms241411671. Int J Mol Sci. 2023. PMID: 37511429 Free PMC article.
-
A simple, time- and cost-effective, high-throughput depletion strategy for deep plasma proteomics.Sci Adv. 2023 Mar 29;9(13):eadf9717. doi: 10.1126/sciadv.adf9717. Epub 2023 Mar 29. Sci Adv. 2023. PMID: 36989362 Free PMC article.
-
Longitudinal plasma proteomic analysis of 1117 hospitalized patients with COVID-19 identifies features associated with severity and outcomes.Sci Adv. 2024 May 24;10(21):eadl5762. doi: 10.1126/sciadv.adl5762. Epub 2024 May 24. Sci Adv. 2024. PMID: 38787940 Free PMC article.
-
Multi-omic longitudinal study reveals immune correlates of clinical course among hospitalized COVID-19 patients.Cell Rep Med. 2023 Jun 20;4(6):101079. doi: 10.1016/j.xcrm.2023.101079. Epub 2023 May 23. Cell Rep Med. 2023. PMID: 37327781 Free PMC article.
-
Analytical challenges in omics research on asthma and allergy: A National Institute of Allergy and Infectious Diseases workshop.J Allergy Clin Immunol. 2024 Apr;153(4):954-968. doi: 10.1016/j.jaci.2024.01.014. Epub 2024 Jan 29. J Allergy Clin Immunol. 2024. PMID: 38295882 Free PMC article.
References
-
- Cox J & Mann M MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat Biotechnol 26, 1367–1372 (2008). - PubMed
Publication types
MeSH terms
Substances
Grants and funding
- U19 AI090023/AI/NIAID NIH HHS/United States
- U19 AI128913/AI/NIAID NIH HHS/United States
- U19 AI118608/AI/NIAID NIH HHS/United States
- U54 AI142766/AI/NIAID NIH HHS/United States
- U19 AI057229/AI/NIAID NIH HHS/United States
- U19 AI062629/AI/NIAID NIH HHS/United States
- U19 AI077439/AI/NIAID NIH HHS/United States
- U19 AI118610/AI/NIAID NIH HHS/United States
- U19 AI128910/AI/NIAID NIH HHS/United States
- R01 AI104870/AI/NIAID NIH HHS/United States
- U19 AI125357/AI/NIAID NIH HHS/United States
- R01 AI145835/AI/NIAID NIH HHS/United States
- R01 AI135803/AI/NIAID NIH HHS/United States
- U19 AI089992/AI/NIAID NIH HHS/United States
LinkOut - more resources
Full Text Sources
Medical