Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jan;17(1):41-44.
doi: 10.1038/s41592-019-0638-x. Epub 2019 Nov 25.

DIA-NN: neural networks and interference correction enable deep proteome coverage in high throughput

Affiliations

DIA-NN: neural networks and interference correction enable deep proteome coverage in high throughput

Vadim Demichev et al. Nat Methods. 2020 Jan.

Abstract

We present an easy-to-use integrated software suite, DIA-NN, that exploits deep neural networks and new quantification and signal correction strategies for the processing of data-independent acquisition (DIA) proteomics experiments. DIA-NN improves the identification and quantification performance in conventional DIA proteomic applications, and is particularly beneficial for high-throughput applications, as it is fast and enables deep and confident proteome coverage when used in combination with fast chromatographic methods.

PubMed Disclaimer

Conflict of interest statement

Competing interests

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. DIA-NN workflow and its performance on conventional and short chromatographic gradients.
a, Schematic: DIA-NN workflow. Chromatograms are extracted for each precursor ion and all its fragment ions (the chromatograms are shown schematically, with different colours corresponding to different fragments). Putative elution peaks are then scored, and the ‘best’ peak (marked with a star) is selected. Potentially interfering peptides are then detected and removed. The precursor-peak matches obtained allow to calculate q-values using an ensemble of deep neural networks as well as remove interferences from the fragment elution curves. b, Identification performance of DIA-NN when processing technical repeat injections of a HeLa tryptic digest analysis (QExactive HF, 0.5h - 4h gradient lengths). Precursor identification numbers are plotted against the FDR, estimated using a two-species compound human-maize spectral library method (Methods). Each point on the graph corresponds to a decoy (maize) precursor, its x-axis value reflecting the estimated FDR at the respective score threshold and its y-axis value being the number of identified target (human) precursors at this threshold. The 0.5h acquisition was not analysed with OpenSWATH for technical reasons. c, Log2-quantities of precursors reported for both the 0.5h acquisition – among top 50000 by Spectronaut (top panel) or DIA-NN (middle panel), and the 4h acquisition (among top 100000 by Spectronaut). R2 values were calculated using linear regression with unity slope. Precursors identified exclusively by either Spectronaut (8379 total) or DIA-NN (8511 total) at 0.5h (i.e. those precursors, identifications of which are not supported by the other tool at the same gradient) are highlighted in yellow. For these, the distribution densities of the differences (centered) between the 0.5h log2-quantities reported by Spectronaut or DIA-NN and 4h log2-quantities reported by Spectronaut (bottom panel) were plotted.
Fig. 2
Fig. 2. LFQbench test performance of DIA-NN.
Quantification precision was benchmarked using two peptide preparations (yeast and E.coli) that were spiked in two different proportions (A and B, three repeat injections each) into a human peptide preparation. The data were processed at 1% q-value (reported by the software tools themselves, i.e. the effective FDR for DIA-NN and Spectronaut may be different) using a spectral library generated from a fractionated sample analysis with DDA. Peptide ratios between the mixtures were visualised using the LFQbench R package (left panel; the dashed lines indicate the expected ratios). Right panel: peptide and protein quantification performance given as box-plots (boxes: interquartile range, whiskers: 1-99 percentile; n = 15442 and 15743 (human), 3403 and 3755 (yeast), 4494 and 4997 (E.coli) for peptide ratios obtained from the reports of Spectronaut and DIA-NN, respectively; n = 1921 and 1950 (human), 529 and 550 (yeast), 566 and 616 (E.coli) for protein ratios obtained from the reports of Spectronaut and DIA-NN, respectively).

References

    1. Yates JR, Ruse CI, Nakorchevsky A. Annu Rev Biomed Eng. 2009;11:49–79. - PubMed
    1. Aebersold R, Mann M. Nature. 2016 Sep;537:347–355. - PubMed
    1. Geyer PE, Holdt LM, Teupser D, Mann M. Mol Syst Biol. 2017 Sep;13:942. - PMC - PubMed
    1. Zelezniak A, et al. Cell Syst. 2018;7:269–283.e6. - PMC - PubMed
    1. Bruderer R, et al. Mol Cell Proteomics. 2015 May;14:1400–1410. - PMC - PubMed

Publication types