Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jan 1:2020:baz146.
doi: 10.1093/database/baz146.

KRGDB: the large-scale variant database of 1722 Koreans based on whole genome sequencing

Affiliations

KRGDB: the large-scale variant database of 1722 Koreans based on whole genome sequencing

Kwang Su Jung et al. Database (Oxford). .

Erratum in

Abstract

Since 2012, the Center for Genome Science of the Korea National Institute of Health (KNIH) has been sequencing complete genomes of 1722 Korean individuals. As a result, more than 32 million variant sites have been identified, and a large proportion of the variant sites have been detected for the first time. In this article, we describe the Korean Reference Genome Database (KRGDB) and its genome browser. The current version of our database contains both single nucleotide and short insertion/deletion variants. The DNA samples were obtained from four different origins and sequenced in different sequencing depths (10× coverage of 63 individuals, 20× coverage of 194 individuals, combined 10× and 20× coverage of 135 individuals, 30× coverage of 230 individuals and 30× coverage of 1100 individuals). The major features of the KRGDB are that it contains information on the Korean genomic variant frequency, frequency difference between the Korean and other populations and the variant functional annotation (such as regulatory elements in ENCODE regions and coding variant functions) of the variant sites. Additionally, we performed the genome-wide association study (GWAS) between Korean genome variant sites for the 30×230 individuals and three major common diseases (diabetes, hypertension and metabolic syndrome). The association results are displayed on our browser. The KRGDB uses the MySQL database and Apache-Tomcat web server adopted with Java Server Page (JSP) and is freely available at http://coda.nih.go.kr/coda/KRGDB/index.jsp. Availability: http://coda.nih.go.kr/coda/KRGDB/index.jsp.

PubMed Disclaimer

Figures

Figure 1
Figure 1
System architecture of KRGDB and Genome Browser. The system mainly consists of variation/annotation database and its genome browser.
Figure 2
Figure 2
Alternative allele frequency difference between KRG and HapMap III ethnics. The horizontal axis denotes the genomic positions of the chosen chromosome (chr1).
Figure 3
Figure 3
Alternative allele frequency difference between KRG and 1000 Genomes ethnics. The horizontal axis denotes the genomic positions of the chosen chromosome (chr1).
Figure 4
Figure 4
Disease risks of type II diabetes (DM), hypertension (HTN) and metabolic syndrome (MS). Each dot represents risk P values (–logP). The red and blue colour indicates odds ratio ≥1.0 and <1.0, respectively. The horizontal axis denotes the genomic positions of the chosen chromosome (chr1).

References

    1. Shendure J. and Ji H. (2008) Next-generation DNA sequencing. Nat. Biotechnol., 26, 1135–1145. - PubMed
    1. 1000 Genomes Project Consortium (2012) An integrated map of genetic variation from 1092 human genomes. Nature, 491, 56–65. - PMC - PubMed
    1. Wheeler D.A., Srinivasan M., Egholm M. et al. (2008) The complete genome of an individual by massively parallel DNA sequencing. Nature, 452, 872–876. - PubMed
    1. Pushkarev D., Neff N.F. and Quake S.R. (2009) Single-molecule sequencing of an individual human genome. Nat. Biotechnol., 27, 847–850. - PMC - PubMed
    1. Bentley D.R., Balasubramanian S., Swerdlow H.P. et al. (2008) Accurate whole human genome sequencing using reversible terminator chemistry. Nature, 456, 53–59. - PMC - PubMed

Publication types