Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Apr;18(2):104-119.
doi: 10.1016/j.gpb.2019.11.008. Epub 2020 Aug 12.

DPHL: A DIA Pan-human Protein Mass Spectrometry Library for Robust Biomarker Discovery

Tiansheng Zhu  1 Yi Zhu  2 Yue Xuan  3 Huanhuan Gao  4 Xue Cai  4 Sander R Piersma  5 Thang V Pham  5 Tim Schelfhorst  5 Richard R G D Haas  5 Irene V Bijnsdorp  6 Rui Sun  4 Liang Yue  4 Guan Ruan  4 Qiushi Zhang  4 Mo Hu  7 Yue Zhou  7 Winan J Van Houdt  8 Tessa Y S Le Large  9 Jacqueline Cloos  10 Anna Wojtuszkiewicz  10 Danijela Koppers-Lalic  11 Franziska Böttger  12 Chantal Scheepbouwer  13 Ruud H Brakenhoff  14 Geert J L H van Leenders  15 Jan N M Ijzermans  16 John W M Martens  17 Renske D M Steenbergen  18 Nicole C Grieken  18 Sathiyamoorthy Selvarajan  19 Sangeeta Mantoo  19 Sze S Lee  20 Serene J Y Yeow  20 Syed M F Alkaff  19 Nan Xiang  4 Yaoting Sun  4 Xiao Yi  4 Shaozheng Dai  21 Wei Liu  4 Tian Lu  4 Zhicheng Wu  1 Xiao Liang  4 Man Wang  22 Yingkuan Shao  23 Xi Zheng  23 Kailun Xu  23 Qin Yang  24 Yifan Meng  24 Cong Lu  25 Jiang Zhu  25 Jin'e Zheng  25 Bo Wang  26 Sai Lou  27 Yibei Dai  28 Chao Xu  29 Chenhuan Yu  30 Huazhong Ying  30 Tony K Lim  19 Jianmin Wu  22 Xiaofei Gao  31 Zhongzhi Luan  21 Xiaodong Teng  26 Peng Wu  24 Shi'ang Huang  25 Zhihua Tao  28 Narayanan G Iyer  20 Shuigeng Zhou  32 Wenguang Shao  33 Henry Lam  34 Ding Ma  24 Jiafu Ji  22 Oi L Kon  20 Shu Zheng  23 Ruedi Aebersold  35 Connie R Jimenez  5 Tiannan Guo  36
Affiliations

DPHL: A DIA Pan-human Protein Mass Spectrometry Library for Robust Biomarker Discovery

Tiansheng Zhu et al. Genomics Proteomics Bioinformatics. 2020 Apr.

Abstract

To address the increasing need for detecting and validating protein biomarkers in clinical specimens, mass spectrometry (MS)-based targeted proteomic techniques, including the selected reaction monitoring (SRM), parallel reaction monitoring (PRM), and massively parallel data-independent acquisition (DIA), have been developed. For optimal performance, they require the fragment ion spectra of targeted peptides as prior knowledge. In this report, we describe a MS pipeline and spectral resource to support targeted proteomics studies for human tissue samples. To build the spectral resource, we integrated common open-source MS computational tools to assemble a freely accessible computational workflow based on Docker. We then applied the workflow to generate DPHL, a comprehensive DIA pan-human library, from 1096 data-dependent acquisition (DDA) MS raw files for 16 types of cancer samples. This extensive spectral resource was then applied to a proteomic study of 17 prostate cancer (PCa) patients. Thereafter, PRM validation was applied to a larger study of 57 PCa patients and the differential expression of three proteins in prostate tumor was validated. As a second application, the DPHL spectral resource was applied to a study consisting of plasma samples from 19 diffuse large B cell lymphoma (DLBCL) patients and 18 healthy control subjects. Differentially expressed proteins between DLBCL patients and healthy control subjects were detected by DIA-MS and confirmed by PRM. These data demonstrate that the DPHL supports DIA and PRM MS pipelines for robust protein biomarker discovery. DPHL is freely accessible at https://www.iprox.org/page/project.html?id=IPX0001400000.

Keywords: Data-independent acquisition; Diffuse large B cell lymphoma; Parallel reaction monitoring; Prostate cancer; Spectral library.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest The research group of Tiannan Guo is partly supported by Thermo Fisher Scientific and Pressure Biosciences, which provided access to advanced sample preparation instrumentation. Yue Xuan, Mo Hu, and Yue Zhou were employees of Thermo Fisher Scientific during this project. The remaining authors declare no competing interests.

Figures

Figure 1
Figure 1
Workflow for building DPHL A. Schematic representation of DDA shotgun proteomics data acquisition. Numbers in parentheses indicate the number of DDA files per tissue type. B. Protein identification and iRT detection from DDA raw files using pFind. C. SiRT detection and calibration. D. CiRT detection and calibration. E. Generation of DPHL. Details of the commands are presented in File S18. DDA, data-dependent acquisition; DIA, data-independent acquisition; iRT, indexed retention time; PCT, pressure cycling technology; SCX, strong cation-exchange; SiRT, synthetic iRT; CiRT, common internal iRT; DPHL, DIA pan-human library.
Figure 2
Figure 2
Comparison of DPHL and PHL A. Venn diagram showing the comparison of transition groups (i.e., peptide precursors), peptides, protein groups, and proteins in DPHL and PHL. B. Visualization of tissue intersections using R package UpSet. C. Bar plots displaying the number of transition groups, peptides, protein groups, proteins in DPHL library for each sample type. PHL, pan-human spectral library.
Figure 3
Figure 3
PCa proteome using 60-min gradient DIA A. Number of protein groups and peptide precursors identified using SiRT and CiRT. B. Technical reproducibility of proteome matrix using CiRT and SiRT. C. Comparison of protein quantification based on MS intensity using the SiRT and CiRT methods. D. 2D plane t-SNE plot of disease classes, color coded by sample type using CiRT and SiRT. E. Boxplots showing the expression (MS intensity) of the significantly dysregulated proteins; P values adjusted with Benjamini & Hochberg are shown under each protein name. ROC curves of the proteins were also shown. R1, technical replicate 1; R2, technical replicate 2; PCa, prostate cancer; BPH, benign prostate hyperplasia; t-SNE, t-distributed stochastic neighbor embedding; FASN, fatty acid synthetase, UniProtKB: P49327; TPP1, tripeptidyl-peptidase 1, UniProtKB: O14773; SPON2, spondin-2, UniProtKB: Q9BUD6.
Figure 4
Figure 4
DIA analysis of plasma samples from DLBCL patients and HC subjects A. Technical reproducibility for protein quantification of four plasma samples from two DLBCL patients and two healthy control subjects. B. 2D plane t-SNE plot showing that proteomes are separated. C. Volcano plot showing significantly down-regulated (blue) and up-regulated (red) proteins in 37 plasma samples (19 samples from DLBCL patients and 18 samples from HC subjects). D. The relative protein expression as calculated by MS intensity for CRP and SAA1. P values adjusted with Benjamini & Hochberg are shown under each protein name Left: Boxplot and ROC curve of CRP. Right: Boxplot and ROC curve of SAA1. DLBCL, diffuse large B cell lymphoma; HC, healthy control; CRP, C-reactive protein, UniProtKB: P02741; SAA1, serum amyloid A1, UniProtKB: P0DJI8.
Figure 5
Figure 5
PRM validation of TPP1, FASN, and SPON2 across 73 peptide samples from 53 PCa patients Two best flying peptides were selected for each protein. For each peptide, boxplot shows the relative abundance of the peptide across 73 PRM runs as calculated from MS intensity (on the left), and XIC demonstrates a representative peak group of the peptide (on the right). P values are computed using Student’s t test. PRM, parallel reaction monitoring; XIC, extracted ion chromatogram.

References

    1. Schubert O.T., Gillet L.C., Collins B.C., Navarro P., Rosenberger G., Wolski W.E. Building high-quality assay libraries for targeted analysis of SWATH MS data. Nat Protoc. 2015;10:426–441. - PubMed
    1. Sandhu C., Qureshi A., Emili A. Panomics for precision medicine. Trends Mol Med. 2018;24:85–101. - PMC - PubMed
    1. Aronson S.J., Rehm H.L. Building the foundation for genomics in precision medicine. Nature. 2015;526:336–342. - PMC - PubMed
    1. Yang J.Y.C., Sarwal M.M. Transplant genetics and genomics. Nat Rev Genet. 2017;18:309–326. - PubMed
    1. Zhang B., Wang J., Wang X., Zhu J., Liu Q., Shi Z. Proteogenomic characterization of human colon and rectal cancer. Nature. 2014;513:382–387. - PMC - PubMed

Publication types