The first Korean genome sequence and analysis: full genome sequencing for a socio-ethnic group
- PMID: 19470904
- PMCID: PMC2752128
- DOI: 10.1101/gr.092197.109
The first Korean genome sequence and analysis: full genome sequencing for a socio-ethnic group
Abstract
We present the first Korean individual genome sequence (SJK) and analysis results. The diploid genome of a Korean male was sequenced to 28.95-fold redundancy using the Illumina paired-end sequencing method. SJK covered 99.9% of the NCBI human reference genome. We identified 420,083 novel single nucleotide polymorphisms (SNPs) that are not in the dbSNP database. Despite a close similarity, significant differences were observed between the Chinese genome (YH), the only other Asian genome available, and SJK: (1) 39.87% (1,371,239 out of 3,439,107) SNPs were SJK-specific (49.51% against Venter's, 46.94% against Watson's, and 44.17% against the Yoruba genomes); (2) 99.5% (22,495 out of 22,605) of short indels (< 4 bp) discovered on the same loci had the same size and type as YH; and (3) 11.3% (331 out of 2920) deletion structural variants were SJK-specific. Even after attempting to map unmapped reads of SJK to unanchored NCBI scaffolds, HGSV, and available personal genomes, there were still 5.77% SJK reads that could not be mapped. All these findings indicate that the overall genetic differences among individuals from closely related ethnic groups may be significant. Hence, constructing reference genomes for minor socio-ethnic groups will be useful for massive individual genome sequencing.
Figures




Similar articles
-
A highly annotated whole-genome sequence of a Korean individual.Nature. 2009 Aug 20;460(7258):1011-5. doi: 10.1038/nature08211. Epub 2009 Jul 8. Nature. 2009. PMID: 19587683 Free PMC article.
-
The diploid genome sequence of an Asian individual.Nature. 2008 Nov 6;456(7218):60-5. doi: 10.1038/nature07484. Nature. 2008. PMID: 18987735 Free PMC article.
-
Whole genome analysis of a Vietnamese trio.J Biosci. 2015 Mar;40(1):113-24. doi: 10.1007/s12038-015-9501-0. J Biosci. 2015. PMID: 25740146
-
KRGDB: the large-scale variant database of 1722 Koreans based on whole genome sequencing.Database (Oxford). 2020 Jan 1;2020:baz146. doi: 10.1093/database/baz146. Database (Oxford). 2020. PMID: 32133509 Free PMC article.
-
dbSNP in the detail and copy number complexities.Hum Mutat. 2010 Jan;31(1):2-4. doi: 10.1002/humu.21149. Hum Mutat. 2010. PMID: 20024941 Review.
Cited by
-
Genome-wide comparative analyses of GATA transcription factors among 19 Arabidopsis ecotype genomes: Intraspecific characteristics of GATA transcription factors.PLoS One. 2021 May 26;16(5):e0252181. doi: 10.1371/journal.pone.0252181. eCollection 2021. PLoS One. 2021. PMID: 34038437 Free PMC article.
-
Accurate whole-genome sequencing and haplotyping from 10 to 20 human cells.Nature. 2012 Jul 11;487(7406):190-5. doi: 10.1038/nature11236. Nature. 2012. PMID: 22785314 Free PMC article.
-
Whole-genome resequencing allows detection of many rare LINE-1 insertion alleles in humans.Genome Res. 2011 Jun;21(6):985-90. doi: 10.1101/gr.114777.110. Epub 2010 Oct 27. Genome Res. 2011. PMID: 20980553 Free PMC article.
-
Analysis of genomic variation in non-coding elements using population-scale sequencing data from the 1000 Genomes Project.Nucleic Acids Res. 2011 Sep 1;39(16):7058-76. doi: 10.1093/nar/gkr342. Epub 2011 May 19. Nucleic Acids Res. 2011. PMID: 21596777 Free PMC article.
-
Vertical lossless genomic data compression tools for assembled genomes: A systematic literature review.PLoS One. 2020 May 26;15(5):e0232942. doi: 10.1371/journal.pone.0232942. eCollection 2020. PLoS One. 2020. PMID: 32453750 Free PMC article.
References
-
- Anderson S, Bankier AT, Barrell BG, de Bruijn MH, Coulson AR, Drouin J, Eperon IC, Nierlich DP, Roe BA, Sanger F, et al. Sequence and organization of the human mitochondrial genome. Nature. 1981;290:457–465. - PubMed
-
- Andrews RM, Kubacka I, Chinnery PF, Lightowlers RN, Turnbull DM, Howell N. Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA. Nat Genet. 1999;23:147. - PubMed
-
- Bowcock AM, Ruiz-Linares A, Tomfohrde J, Minch E, Kidd JR, Cavalli-Sforza LL. High resolution of human evolutionary trees with polymorphic microsatellites. Nature. 1994;368:455–457. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources