Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Jun 27;6(6):752-758.e1.
doi: 10.1016/j.cels.2018.05.012.

Community-Driven Data Analysis Training for Biology

Affiliations

Community-Driven Data Analysis Training for Biology

Bérénice Batut et al. Cell Syst. .

Abstract

The primary problem with the explosion of biomedical datasets is not the data, not computational resources, and not the required storage space, but the general lack of trained and skilled researchers to manipulate and analyze these data. Eliminating this problem requires development of comprehensive educational resources. Here we present a community-driven framework that enables modern, interactive teaching of data analytics in life sciences and facilitates the development of training materials. The key feature of our system is that it is not a static but a continuously improved collection of tutorials. By coupling tutorials with a web-based analysis framework, biomedical researchers can learn by performing computation themselves through a web browser without the need to install software or search for example datasets. Our ultimate goal is to expand the breadth of training materials to include fundamental statistical and data science topics and to precipitate a complete re-engineering of undergraduate and graduate curricula in life sciences. This project is accessible at https://training.galaxyproject.org.

Keywords: data analysis; genomics; next-generation sequencing; proteomics; training.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.. Key elements of an interactive tutorial.
A. A list of tutorials dedicated to Metagenomics. There is a set of introductory slides and two hands-on tutorials. B. A fragment of introductory material within a tutorial. C. A “hands-on” element with upper box contains instructions for running a tool inside Galaxy and shown example output of Krona tool. The question box at the bottom contains togglable answer field. D. Summary of key points for this tutorial displayed on the bottom of the tutorial. E. Fragment of Galaxy interface showing interactive tour balloon.
Figure 2.
Figure 2.. Structure and development of content in GitHub
(http://github.com/galaxyproject/training-material). The material is organized in different topics, each topic in a dedicated directory. Inside each topic’s directory, the structure is the same: a metadata file, a directory with the topic introduction slide decks, a directory with the tutorials and a directory with the Dockerfile describing the details to build a container for the topic that would contain a dedicated Galaxy instance with all tools relevant for the tutorials. Inside the topic directory, each tutorial related to the topic has its own subdirectory with several files: a tutorial file written in Markdown with hands-on, an optional slides file to support the tutorial, a directory with Galaxy Interactive Tours to reproduce the tutorial, a directory with workflows extracted from the tutorial, a file with the links to the input data needed for the tutorial and a file with the description of needed tools to run the tutorial. The process of development of new content is shown at the bottom of the figure.
Figure 3.
Figure 3.. History of training activities.
Number (A) and location (B) of registered training events organized by the Galaxy Training Network since 2015. C. Number of tutorial contributors per month.

References

    1. Larcombe L, Hendricusdottir R, Attwood TK, Bacall F, Beard N, Bellis LJ, et al. ELIXIR-UK role in bioinformatics training at the national level and across ELIXIR. F1000Res. 2017;6. doi:10.12688/f1000research.11837.1 - DOI - PMC - PubMed
    1. Williams JJ, Teal TK. A vision for collaborative training infrastructure for bioinformatics. Ann N Y Acad Sci. 2017;1387: 54–60. - PubMed
    1. Attwood TK, Blackford S, Brazas MD, Davies A, Schneider MV. A global perspective on evolving bioinformatics and data science training needs. Brief Bioinform. 2017; doi:10.1093/bib/bbx100 - DOI - PMC - PubMed
    1. Community Survey Report - 2013 - EMBL-ABR. In: EMBL-ABR [Internet]. [cited 17 October 2017]. Available: https://www.embl-abr.org.au/news/braembl-community-survey-report-2013/
    1. Afgan E, Baker D, van den Beek M, Blankenberg D, Bouvier D, Čech M, et al. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update. Nucleic Acids Res. 2016;44: W3–W10. - PMC - PubMed

Publication types

LinkOut - more resources