A Parallelization Strategy for the Time Efficient Analysis of Thousands of LC/MS Runs in High-Performance Computing Environment
- PMID: 36201825
- PMCID: PMC9930095
- DOI: 10.1021/acs.jproteome.2c00278
A Parallelization Strategy for the Time Efficient Analysis of Thousands of LC/MS Runs in High-Performance Computing Environment
Abstract
Combining robust proteomics instrumentation with high-throughput enabling liquid chromatography (LC) systems (e.g., timsTOF Pro and the Evosep One system, respectively) enabled mapping the proteomes of 1000s of samples. Fragpipe is one of the few computational protein identification and quantification frameworks that allows for the time-efficient analysis of such large data sets. However, it requires large amounts of computational power and data storage space that leave even state-of-the-art workstations underpowered when it comes to the analysis of proteomics data sets with 1000s of LC mass spectrometry runs. To address this issue, we developed and optimized a Fragpipe-based analysis strategy for a high-performance computing environment and analyzed 3348 plasma samples (6.4 TB) that were longitudinally collected from hospitalized COVID-19 patients under the auspice of the Immunophenotyping Assessment in a COVID-19 Cohort (IMPACC) study. Our parallelization strategy reduced the total runtime by ∼90% from 116 (theoretical) days to just 9 days in the high-performance computing environment. All code is open-source and can be deployed in any Simple Linux Utility for Resource Management (SLURM) high-performance computing environment, enabling the analysis of large-scale high-throughput proteomics studies.
Keywords: Fragpipe; HPC; SLURM; parallelization; proteomics; timsTOF.
Conflict of interest statement
Conflicts of interest
N/A
Figures
References
-
- Cox J & Mann M MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat Biotechnol 26, 1367–1372 (2008). - PubMed
Publication types
MeSH terms
Substances
Grants and funding
- U19 AI090023/AI/NIAID NIH HHS/United States
- U19 AI128913/AI/NIAID NIH HHS/United States
- U19 AI118608/AI/NIAID NIH HHS/United States
- U54 AI142766/AI/NIAID NIH HHS/United States
- U19 AI057229/AI/NIAID NIH HHS/United States
- U19 AI062629/AI/NIAID NIH HHS/United States
- U19 AI077439/AI/NIAID NIH HHS/United States
- U19 AI118610/AI/NIAID NIH HHS/United States
- U19 AI128910/AI/NIAID NIH HHS/United States
- R01 AI104870/AI/NIAID NIH HHS/United States
- U19 AI125357/AI/NIAID NIH HHS/United States
- R01 AI145835/AI/NIAID NIH HHS/United States
- R01 AI135803/AI/NIAID NIH HHS/United States
- U19 AI089992/AI/NIAID NIH HHS/United States
LinkOut - more resources
Full Text Sources
Medical
