Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 Mar 22;22(2):664-675.
doi: 10.1093/bib/bbaa359.

A review on viral data sources and search systems for perspective mitigation of COVID-19

Affiliations
Review

A review on viral data sources and search systems for perspective mitigation of COVID-19

Anna Bernasconi et al. Brief Bioinform. .

Abstract

With the outbreak of the COVID-19 disease, the research community is producing unprecedented efforts dedicated to better understand and mitigate the effects of the pandemic. In this context, we review the data integration efforts required for accessing and searching genome sequences and metadata of SARS-CoV2, the virus responsible for the COVID-19 disease, which have been deposited into the most important repositories of viral sequences. Organizations that were already present in the virus domain are now dedicating special interest to the emergence of COVID-19 pandemics, by emphasizing specific SARS-CoV2 data and services. At the same time, novel organizations and resources were born in this critical period to serve specifically the purposes of COVID-19 mitigation while setting the research ground for contrasting possible future pandemics. Accessibility and integration of viral sequence data, possibly in conjunction with the human host genotype and clinical data, are paramount to better understand the COVID-19 disease and mitigate its effects. Few examples of host-pathogen integrated datasets exist so far, but we expect them to grow together with the knowledge of COVID-19 disease; once such datasets will be available, useful integrative surveillance mechanisms can be put in place by observing how common variants distribute in time and space, relating them to the phenotypic impact evidenced in the literature.

Keywords: COVID-19; data harmonization; epidemic; genomics; integration and search; metadata; viral sequences.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Entity-relationship diagram of the Phenotype Data Dictionary proposed within the COVID-19 Host Genetics Initiative.

References

    1. Sayers EW, Cavanaugh M, Clark K, et al. GenBank. Nucleic Acids Res 2019;47(D1):D94–9. - PMC - PubMed
    1. Shu Y, McCauley J. GISAID: global initiative on sharing all influenza data–from vision to reality. Eurosurveillance 2017;22(13). - PMC - PubMed
    1. Elbe S, Buckland-Merrett G. Data, disease and diplomacy: GISAID’s innovative contribution to global health. Global Challenges 2017;1(1):33–46. - PMC - PubMed
    1. The COVID-19 Genomics UK (COG-UK) consortium An integrated national scale SARS-CoV-2 genomic surveillance network. The Lancet Microbe 2020;1(3):E99–100. - PMC - PubMed
    1. WHO’s Code of Conduct for Open and Timely Sharing of Pathogen Genetic Sequence Data During Outbreaks of Infectious Disease https://www.who.int/blueprint/what/norms-standards/GSDDraftCodeConduct_f.... (8 October 2020, date last accessed).

Publication types