The Thai reference exome (T-REx) variant database
- PMID: 34496037
- DOI: 10.1111/cge.14060
The Thai reference exome (T-REx) variant database
Abstract
To maximize the potential of genomics in medicine, it is essential to establish databases of genomic variants for ethno-geographic groups that can be used for filtering and prioritizing candidate pathogenic variants. Populations with non-European ancestry are poorly represented among current genomic variant databases. Here, we report the first high-density survey of genomic variants for the Thai population, the Thai Reference Exome (T-REx) variant database. T-REx comprises exome sequencing data of 1092 unrelated Thai individuals. The targeted exome regions common among four capture platforms cover 30.04 Mbp on autosomes and chromosome X. 345 681 short variants (18.27% of which are novel) and 34 907 copy number variations were found. Principal component analysis on 38 469 single nucleotide variants present worldwide showed that the Thai population is most genetically similar to East and Southeast Asian populations. Moreover, unsupervised clustering revealed six Thai subpopulations consistent with the evidence of gene flow from neighboring populations. The prevalence of common pathogenic variants in T-REx was investigated in detail, which revealed subpopulation-specific patterns, in particular variants associated with erythrocyte disorders such as the HbE variant in HBB and the Viangchan variant in G6PD. T-REx serves as a pivotal addition to the current databases for genomic medicine.
Keywords: Thai; database; genomic medicine; population genetics; precision medicine; rare disease.
© 2021 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
References
REFERENCES
-
- Karczewski KJ, Francioli LC, Tiao G, et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020;581(7809):434.
-
- Wall JD, Stawiski EW, Ratan A, et al. The GenomeAsia 100K project enables genetic discoveries across Asia. Nature. 2019;576(7785):106-111.
-
- Lee S, Seo J, Park J, et al. Korean variant archive (KOVA): a reference database of genetic variations in the Korean population. Sci Rep. 2017;7(1):4287.
-
- Kim J, Weber JA, Jho S, et al. KoVariome: Korean National Standard Reference Variome database of whole genomes with comprehensive SNV, indel, CNV, and SV analyses. Sci Rep. 2018;8(1):5677.
-
- Wu D, Dou J, Chai X, et al. Large-scale whole-genome sequencing of three diverse Asian populations in Singapore. Cell. 2019;179(3):736-749.e715.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous
