Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Dec 31;13(12):e0209603.
doi: 10.1371/journal.pone.0209603. eCollection 2018.

Comparative study for haplotype block partitioning methods - Evidence from chromosome 6 of the North American Rheumatoid Arthritis Consortium (NARAC) dataset

Affiliations

Comparative study for haplotype block partitioning methods - Evidence from chromosome 6 of the North American Rheumatoid Arthritis Consortium (NARAC) dataset

Mohamed N Saad et al. PLoS One. .

Abstract

Haplotype-based methods compete with "one-SNP-at-a-time" approaches on being preferred for association studies. Chromosome 6 contains most of the known genetic biomarkers for rheumatoid arthritis (RA) disease. Therefore, chromosome 6 serves as a benchmark for the haplotype methods testing. The aim of this study is to test the North American Rheumatoid Arthritis Consortium (NARAC) dataset to find out if haplotype block methods or single-locus approaches alone can sufficiently provide the significant single nucleotide polymorphisms (SNPs) associated with RA. In addition, could we be satisfied with only one method of the haplotype block methods for partitioning chromosome 6 of the NARAC dataset? In the NARAC dataset, chromosome 6 comprises 35,574 SNPs for 2,062 individuals (868 cases, 1,194 controls). Individual SNP approach and three haplotype block methods were applied to the NARAC dataset to identify the RA biomarkers. We employed three haplotype partitioning methods which are confidence interval test (CIT), four gamete test (FGT), and solid spine of linkage disequilibrium (SSLD). P-values after stringent Bonferroni correction for multiple testing were measured to assess the strength of association between the genetic variants and RA susceptibility. Moreover, the block size (in base pairs (bp) and number of SNPs included), number of blocks, percentage of uncovered SNPs by the block method, percentage of significant blocks from the total number of blocks, number of significant haplotypes and SNPs were used to compare among the three haplotype block methods. Individual SNP, CIT, FGT, and SSLD methods detected 432, 1,086, 1,099, and 1,322 associated SNPs, respectively. Each method identified significant SNPs that were not detected by any other method (Individual SNP: 12, FGT: 37, CIT: 55, and SSLD: 189 SNPs). 916 SNPs were discovered by all the three haplotype block methods. 367 SNPs were discovered by the haplotype block methods and the individual SNP approach. The P-values of these 367 SNPs were lower than those of the SNPs uniquely detected by only one method. The 367 SNPs detected by all the methods represent promising candidates for RA susceptibility. They should be further investigated for the European population. A hybrid technique including the four methods should be applied to detect the significant SNPs associated with RA for chromosome 6 of the NARAC dataset. Moreover, SSLD method may be preferred for its favored benefits in case of selecting only one method.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Experimental design of the proposed system.
Fig 2
Fig 2. Quantile–quantile (Q–Q) plots of the chromosome 6 association results for all the used methods.
Q-Q plot shows the observed distribution of–log10(P-values) on the Y-axis compared to the expected distribution of–log10(P-values) on the X-axis. The red line (X = Y) represents the null distribution. SNP refers to the individual SNP approach.
Fig 3
Fig 3. Manhattan plot for the associations between genotyped SNPs (chromosome 6 of the NARAC dataset) and RA susceptibility using individual SNP approach.
The upper part represents chromosome 6 ideogram and its genomic coordinates. The Y-axis shows–log10(P-values), which represent the strength of association. The larger the point and the higher the point on the scale, the more significant the association with RA susceptibility.
Fig 4
Fig 4. Comparison of the haplotype blocks obtained by the three methods.
Each circle conistituted a significant haplotype block associated with RA susceptibility. The plot area represented chromosome 6, then zooming in on the MHC region.

Similar articles

Cited by

References

    1. Saad M. N., Mabrouk M. S., Eldeib A. M. and Shaker O. G., " Identification of rheumatoid arthritis biomarkers based on single nucleotide polymorphisms and haplotype blocks: a systematic review and meta-analysis," J Adv Res, vol. 7, pp. 1–16, 2016. 10.1016/j.jare.2015.01.008 - DOI - PMC - PubMed
    1. Peise E., Fabregat-Traver D. and Bientinesi P., "High performance solutions for big-data GWAS," Parallel Computing, vol. 42, pp. 75–87, 2015.
    1. Clark A. G., "The role of haplotypes in candidate gene studies," Genet Epidemiol, vol. 27, pp. 321–33, December 2004. 10.1002/gepi.20025 - DOI - PubMed
    1. Su S. C., Kuo C. C. and Chen T., "Single nucleotide polymorphism data analysis—state-of-the-art review on this emerging field from a signal processing viewpoint," IEEE Signal Process Mag, vol. 24, pp. 75–82, 2007.
    1. Shim H., Chun H., Engelman C. D. and Payseur B. A., "Genome-wide association studies using single-nucleotide polymorphisms versus haplotypes: an empirical comparison with data from the North American Rheumatoid Arthritis Consortium," BMC Proc, vol. 3 Suppl 7, p. S35, 2009. - PMC - PubMed

Publication types