Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jan 18:18:113-126.
doi: 10.1016/j.jare.2019.01.006. eCollection 2019 Jul.

Studying the effects of haplotype partitioning methods on the RA-associated genomic results from the North American Rheumatoid Arthritis Consortium (NARAC) dataset

Affiliations

Studying the effects of haplotype partitioning methods on the RA-associated genomic results from the North American Rheumatoid Arthritis Consortium (NARAC) dataset

Mohamed N Saad et al. J Adv Res. .

Abstract

The human genome, which includes thousands of genes, represents a big data challenge. Rheumatoid arthritis (RA) is a complex autoimmune disease with a genetic basis. Many single-nucleotide polymorphism (SNP) association methods partition a genome into haplotype blocks. The aim of this genome wide association study (GWAS) was to select the most appropriate haplotype block partitioning method for the North American Rheumatoid Arthritis Consortium (NARAC) dataset. The methods used for the NARAC dataset were the individual SNP approach and the following haplotype block methods: the four-gamete test (FGT), confidence interval test (CIT), and solid spine of linkage disequilibrium (SSLD). The measured parameters that reflect the strength of the association between the biomarker and RA were the P-value after Bonferroni correction and other parameters used to compare the output of each haplotype block method. This work presents a comparison among the individual SNP approach and the three haplotype block methods to select the method that can detect all the significant SNPs when applied alone. The GWAS results from the NARAC dataset obtained with the different methods are presented. The individual SNP, CIT, FGT, and SSLD methods detected 541, 1516, 1551, and 1831 RA-associated SNPs respectively, and the individual SNP, FGT, CIT, and SSLD methods detected 65, 156, 159, and 450 significant SNPs respectively, that were not detected by the other methods. Three hundred eighty-three SNPs were discovered by the haplotype block methods and the individual SNP approach, while 1021 SNPs were discovered by all three haplotype block methods. The 383 SNPs detected by all the methods are promising candidates for studying RA susceptibility. A hybrid technique involving all four methods should be applied to detect the significant SNPs associated with RA in the NARAC dataset, but the SSLD method may be preferred because of its advantages when only one method was used.

Keywords: Confidence interval test; Four-gamete test; Genome-wide association study; NARAC; Rheumatoid arthritis; Solid spine of linkage disequilibrium.

PubMed Disclaimer

Figures

None
Graphical abstract
Fig. 1
Fig. 1
Snapshot of the NARAC dataset showing 10 samples with their corresponding 3 SNPs. The first column represents the individuals’ IDs. The second column refers to the affection status (0: case, 1: control). The third column shows the sex (F: female, M: male). The next columns correspond to the SNPs, with the first row providing the SNP ID. In each SNP cell, two identical alleles represent a homozygote, whereas two different alleles represent a heterozygote.
Fig. 2
Fig. 2
Summary of the proposed system for the NARAC dataset.
Fig. 3
Fig. 3
Comparison of the RA-associated results obtained by the three haplotype block partitioning methods. (a) The total number of significant blocks for each Chr. (b) The total number of associated SNPs for each Chr. (c) The total significant blocks size in bp for each Chr.
Fig. 4
Fig. 4
Number of RA biomarkers detected by each method – “all” biomarkers detected by the method or detected “only” by one method.
Fig. 5
Fig. 5
Manhattan plot showing the associations between the whole NARAC SNPs and RA susceptibility using the individual SNP approach. The genes with P-values lower than the genome-wide significance threshold are shown above the plot area.
Fig. 6
Fig. 6
Comparison for the CIT and SSLD methods on the same significant haplotype block in the PHF19-TRAF1-C5 region. (a) LD plot showing CIT block comprising eight biomarkers. (b) LD plot for SSLD block including twelve biomarkers.
Fig. 7
Fig. 7
LD plot for the TBX1 region showing a biomarker in this study (rs1005133) and a previously detected biomarker (rs4819522).

Similar articles

Cited by

References

    1. Saad M.N., Mabrouk M.S., Eldeib A.M., Shaker O.G. Identification of rheumatoid arthritis biomarkers based on single nucleotide polymorphisms and haplotype blocks: a systematic review and meta-analysis. J Adv Res. 2016;7(1):1–16. - PMC - PubMed
    1. Saad M.N., Mabrouk M.S., Eldeib A.M., Shaker O.G. 7th Cairo international biomedical engineering conference. IEEE; Cairo, Egypt: 2014. Vitamin D receptor gene polymorphisms in rheumatoid arthritis patients associating osteoporosis; pp. 75–78.
    1. Saad M.N., Mabrouk M.S., Eldeib A.M., Shaker O.G. Effect of MTHFR, TGFβ1, and TNFB polymorphisms on osteoporosis in rheumatoid arthritis patients. Gene. 2015;568(2):124–128. - PubMed
    1. Saad M.N., Mabrouk M.S., Eldeib A.M., Shaker O.G. Genetic case-control study for eight polymorphisms associated with rheumatoid arthritis. PLoS One. 2015;10(7):e0131960. - PMC - PubMed
    1. Alonso N., Lucas G., Hysi P. Big data challenges in bone research: genome-wide association studies and next-generation sequencing. BoneKEy Rep. 2015;4:635. - PMC - PubMed

LinkOut - more resources