Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Oct 19;13(10):e1005755.
doi: 10.1371/journal.pcbi.1005755. eCollection 2017 Oct.

Unmet needs for analyzing biological big data: A survey of 704 NSF principal investigators

Affiliations

Unmet needs for analyzing biological big data: A survey of 704 NSF principal investigators

Lindsay Barone et al. PLoS Comput Biol. .

Erratum in

Abstract

In a 2016 survey of 704 National Science Foundation (NSF) Biological Sciences Directorate principal investigators (BIO PIs), nearly 90% indicated they are currently or will soon be analyzing large data sets. BIO PIs considered a range of computational needs important to their work, including high performance computing (HPC), bioinformatics support, multistep workflows, updated analysis software, and the ability to store, share, and publish data. Previous studies in the United States and Canada emphasized infrastructure needs. However, BIO PIs said the most pressing unmet needs are training in data integration, data management, and scaling analyses for HPC-acknowledging that data science skills will be required to build a deeper understanding of life. This portends a growing data knowledge gap in biology and challenges institutions and funding agencies to redouble their support for computational training in biology.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Major data types used by National Science Foundation (NSF) Biological Sciences Directorate (BIO) principal investigators (PIs).
Fig 2
Fig 2. Current (grey) and future (blue) data analysis needs of National Science Foundation (NSF) Biological Sciences Directorate (BIO) principal investigators (PIs) (percent responding affirmatively, 387 ≤ n ≤ 551).
Fig 3
Fig 3. Unmet data analysis needs of National Science Foundation (NSF) Biological Sciences Directorate (BIO) principal investigators (PIs) (percent responding negatively, 318 ≤ n ≤ 510).

References

    1. GenBank and WGS Statistics. National Center for Biotechnology Information. 2017. Available from: https://www.ncbi.nlm.nih.gov/genbank/statistics/
    1. HiSeq X Series of Sequencing Systems. Illumina. 2017. Available from: https://www.illumina.com/content/dam/illumina-marketing/documents/produc...
    1. Wetterstrand K. DNA Sequencing Costs: Data. National Human Genome Research Institute (NHGRI). 2016. Available from: https://www.genome.gov/sequencingcostsdata/
    1. Sequence Read Archive. National Center for Biotechnology Information. 2017. Available from: https://trace.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?view=announcement
    1. Stephens Z, Lee S, Faghri F, Campbell R, Zhai C, Efron M et al. Big Data: Astronomical or genomical? PLoS Biol. 2015; 13(7): e1002195 doi: 10.1371/journal.pbio.1002195 - DOI - PMC - PubMed

MeSH terms