Development of Bioinformatics Infrastructure for Genomics Research
- PMID: 28302555
- PMCID: PMC5582980
- DOI: 10.1016/j.gheart.2017.01.005
Development of Bioinformatics Infrastructure for Genomics Research
Abstract
Background: Although pockets of bioinformatics excellence have developed in Africa, generally, large-scale genomic data analysis has been limited by the availability of expertise and infrastructure. H3ABioNet, a pan-African bioinformatics network, was established to build capacity specifically to enable H3Africa (Human Heredity and Health in Africa) researchers to analyze their data in Africa. Since the inception of the H3Africa initiative, H3ABioNet's role has evolved in response to changing needs from the consortium and the African bioinformatics community.
Objectives: H3ABioNet set out to develop core bioinformatics infrastructure and capacity for genomics research in various aspects of data collection, transfer, storage, and analysis.
Methods and results: Various resources have been developed to address genomic data management and analysis needs of H3Africa researchers and other scientific communities on the continent. NetMap was developed and used to build an accurate picture of network performance within Africa and between Africa and the rest of the world, and Globus Online has been rolled out to facilitate data transfer. A participant recruitment database was developed to monitor participant enrollment, and data is being harmonized through the use of ontologies and controlled vocabularies. The standardized metadata will be integrated to provide a search facility for H3Africa data and biospecimens. Because H3Africa projects are generating large-scale genomic data, facilities for analysis and interpretation are critical. H3ABioNet is implementing several data analysis platforms that provide a large range of bioinformatics tools or workflows, such as Galaxy, the Job Management System, and eBiokits. A set of reproducible, portable, and cloud-scalable pipelines to support the multiple H3Africa data types are also being developed and dockerized to enable execution on multiple computing infrastructures. In addition, new tools have been developed for analysis of the uniquely divergent African data and for downstream interpretation of prioritized variants. To provide support for these and other bioinformatics queries, an online bioinformatics helpdesk backed by broad consortium expertise has been established. Further support is provided by means of various modes of bioinformatics training.
Conclusions: For the past 4 years, the development of infrastructure support and human capacity through H3ABioNet, have significantly contributed to the establishment of African scientific networks, data analysis facilities, and training programs. Here, we describe the infrastructure and how it has affected genomics and bioinformatics research in Africa.
Copyright © 2017 World Heart Federation (Geneva). Published by Elsevier B.V. All rights reserved.
Figures
Similar articles
-
Developing reproducible bioinformatics analysis workflows for heterogeneous computing environments to support African genomics.BMC Bioinformatics. 2018 Nov 29;19(1):457. doi: 10.1186/s12859-018-2446-1. BMC Bioinformatics. 2018. PMID: 30486782 Free PMC article.
-
Organizing and running bioinformatics hackathons within Africa: The H3ABioNet cloud computing experience.AAS Open Res. 2019 Aug 7;1:9. doi: 10.12688/aasopenres.12847.2. eCollection 2018. AAS Open Res. 2019. PMID: 32382696 Free PMC article.
-
H3ABioNet, a sustainable pan-African bioinformatics network for human heredity and health in Africa.Genome Res. 2016 Feb;26(2):271-7. doi: 10.1101/gr.196295.115. Epub 2015 Dec 1. Genome Res. 2016. PMID: 26627985 Free PMC article.
-
H3Africa and the African life sciences ecosystem: building sustainable innovation.OMICS. 2014 Dec;18(12):733-9. doi: 10.1089/omi.2014.0145. OMICS. 2014. PMID: 25454511 Free PMC article. Review.
-
H3Africa: current perspectives.Pharmgenomics Pers Med. 2018 Apr 10;11:59-66. doi: 10.2147/PGPM.S141546. eCollection 2018. Pharmgenomics Pers Med. 2018. PMID: 29692621 Free PMC article. Review.
Cited by
-
One Step Ahead in Realizing Pharmacogenetics in Low- and Middle-Income Countries: What Should We Do?J Multidiscip Healthc. 2024 Oct 23;17:4863-4874. doi: 10.2147/JMDH.S458564. eCollection 2024. J Multidiscip Healthc. 2024. PMID: 39464786 Free PMC article. Review.
-
Integration of 168,000 samples reveals global patterns of the human gut microbiome.bioRxiv [Preprint]. 2023 Oct 11:2023.10.11.560955. doi: 10.1101/2023.10.11.560955. bioRxiv. 2023. Update in: Cell. 2025 Feb 20;188(4):1100-1118.e17. doi: 10.1016/j.cell.2024.12.017. PMID: 37873416 Free PMC article. Updated. Preprint.
-
Cancer genomics and bioinformatics in Latin American countries: applications, challenges, and perspectives.Front Oncol. 2025 Jul 9;15:1584178. doi: 10.3389/fonc.2025.1584178. eCollection 2025. Front Oncol. 2025. PMID: 40703551 Free PMC article. Review.
-
Public human microbiome data are dominated by highly developed countries.PLoS Biol. 2022 Feb 15;20(2):e3001536. doi: 10.1371/journal.pbio.3001536. eCollection 2022 Feb. PLoS Biol. 2022. PMID: 35167588 Free PMC article.
-
The Sickle Cell Disease Ontology: Enabling Collaborative Research and Co-Designing of New Planetary Health Applications.OMICS. 2020 Oct;24(10):559-567. doi: 10.1089/omi.2020.0153. OMICS. 2020. PMID: 33021900 Free PMC article.
References
-
- Foster I. Globus Online: Accelerating and Democratizing Science through Cloud-Based Services. Internet Computing, IEEE. 2011;15(3):70–73.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous