COVID-19 open source data sets: a comprehensive survey
- PMID: 34764552
- PMCID: PMC7503433
- DOI: 10.1007/s10489-020-01862-6
COVID-19 open source data sets: a comprehensive survey
Abstract
In December 2019, a novel virus named COVID-19 emerged in the city of Wuhan, China. In early 2020, the COVID-19 virus spread in all continents of the world except Antarctica, causing widespread infections and deaths due to its contagious characteristics and no medically proven treatment. The COVID-19 pandemic has been termed as the most consequential global crisis since the World Wars. The first line of defense against the COVID-19 spread are the non-pharmaceutical measures like social distancing and personal hygiene. The great pandemic affecting billions of lives economically and socially has motivated the scientific community to come up with solutions based on computer-aided digital technologies for diagnosis, prevention, and estimation of COVID-19. Some of these efforts focus on statistical and Artificial Intelligence-based analysis of the available data concerning COVID-19. All of these scientific efforts necessitate that the data brought to service for the analysis should be open source to promote the extension, validation, and collaboration of the work in the fight against the global pandemic. Our survey is motivated by the open source efforts that can be mainly categorized as (a) COVID-19 diagnosis from CT scans, X-ray images, and cough sounds, (b) COVID-19 case reporting, transmission estimation, and prognosis from epidemiological, demographic, and mobility data, (c) COVID-19 emotional and sentiment analysis from social media, and (d) knowledge-based discovery and semantic analysis from the collection of scholarly articles covering COVID-19. We survey and compare research works in these directions that are accompanied by open source data and code. Future research directions for data-driven COVID-19 research are also debated. We hope that the article will provide the scientific community with an initiative to start open source extensible and transparent research in the collective fight against the COVID-19 pandemic.
Keywords: Artificial intelligence; COVID-19; Coronavirus; Data sets; Machine learning; Open source; Pandemic.
© Springer Science+Business Media, LLC, part of Springer Nature 2020.
Figures
Similar articles
-
Collaborating in the Time of COVID-19: The Scope and Scale of Innovative Responses to a Global Pandemic.JMIR Public Health Surveill. 2021 Feb 9;7(2):e25935. doi: 10.2196/25935. JMIR Public Health Surveill. 2021. PMID: 33503001 Free PMC article. Review.
-
Adoption of Digital Technologies in Health Care During the COVID-19 Pandemic: Systematic Review of Early Scientific Literature.J Med Internet Res. 2020 Nov 6;22(11):e22280. doi: 10.2196/22280. J Med Internet Res. 2020. PMID: 33079693 Free PMC article.
-
Comprehensive Survey of Using Machine Learning in the COVID-19 Pandemic.Diagnostics (Basel). 2021 Jun 24;11(7):1155. doi: 10.3390/diagnostics11071155. Diagnostics (Basel). 2021. PMID: 34202587 Free PMC article. Review.
-
Lessons Learned from the COVID-19 Pandemic: Emphasizing the Emerging Role and Perspectives from Artificial Intelligence, Mobile Health, and Digital Laboratory Medicine.EJIFCC. 2021 Jun 29;32(2):224-243. eCollection 2021 Jun. EJIFCC. 2021. PMID: 34421492 Free PMC article. Review.
-
Application of Artificial Intelligence in COVID-19 Diagnosis and Therapeutics.J Pers Med. 2021 Sep 4;11(9):886. doi: 10.3390/jpm11090886. J Pers Med. 2021. PMID: 34575663 Free PMC article. Review.
Cited by
-
Medical imaging and computational image analysis in COVID-19 diagnosis: A review.Comput Biol Med. 2021 Aug;135:104605. doi: 10.1016/j.compbiomed.2021.104605. Epub 2021 Jun 23. Comput Biol Med. 2021. PMID: 34175533 Free PMC article. Review.
-
Bibliometric analysis of the use of artificial intelligence in COVID-19 based on scientific studies.Health Sci Rep. 2023 May 4;6(5):e1244. doi: 10.1002/hsr2.1244. eCollection 2023 May. Health Sci Rep. 2023. PMID: 37152228 Free PMC article.
-
Challenges issues and future recommendations of deep learning techniques for SARS-CoV-2 detection utilising X-ray and CT images: a comprehensive review.PeerJ Comput Sci. 2024 Dec 24;10:e2517. doi: 10.7717/peerj-cs.2517. eCollection 2024. PeerJ Comput Sci. 2024. PMID: 39896401 Free PMC article.
-
Characteristics of Imperial College London's COVID-19 research outputs.Learn Publ. 2021 Jul;34(3):358-369. doi: 10.1002/leap.1358. Epub 2021 Jan 12. Learn Publ. 2021. PMID: 33821101 Free PMC article.
-
Machine Learning-Based Prediction of Growth in Confirmed COVID-19 Infection Cases in 114 Countries Using Metrics of Nonpharmaceutical Interventions and Cultural Dimensions: Model Development and Validation.J Med Internet Res. 2021 Apr 23;23(4):e26628. doi: 10.2196/26628. J Med Internet Res. 2021. PMID: 33844636 Free PMC article.
References
-
- World Health Organization (2020) Coronavirus disease 2019 (covid-19): situation report 162
-
- Lopez CE, Vasu M, Gallemore C (2020) Understanding the perception of covid-19 policies by mining a multilanguage twitter dataset. arXiv:2003.10359
LinkOut - more resources
Full Text Sources
Other Literature Sources