Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Mar;77(3):283-287.
doi: 10.1016/j.humimm.2015.12.006. Epub 2015 Dec 18.

Bridging ImmunoGenomic Data Analysis Workflow Gaps (BIGDAWG): An integrated case-control analysis pipeline

Affiliations

Bridging ImmunoGenomic Data Analysis Workflow Gaps (BIGDAWG): An integrated case-control analysis pipeline

Derek J Pappas et al. Hum Immunol. 2016 Mar.

Abstract

Bridging ImmunoGenomic Data-Analysis Workflow Gaps (BIGDAWG) is an integrated data-analysis pipeline designed for the standardized analysis of highly-polymorphic genetic data, specifically for the HLA and KIR genetic systems. Most modern genetic analysis programs are designed for the analysis of single nucleotide polymorphisms, but the highly polymorphic nature of HLA and KIR data require specialized methods of data analysis. BIGDAWG performs case-control data analyses of highly polymorphic genotype data characteristic of the HLA and KIR loci. BIGDAWG performs tests for Hardy-Weinberg equilibrium, calculates allele frequencies and bins low-frequency alleles for k×2 and 2×2 chi-squared tests, and calculates odds ratios, confidence intervals and p-values for each allele. When multi-locus genotype data are available, BIGDAWG estimates user-specified haplotypes and performs the same binning and statistical calculations for each haplotype. For the HLA loci, BIGDAWG performs the same analyses at the individual amino-acid level. Finally, BIGDAWG generates figures and tables for each of these comparisons. BIGDAWG obviates the error-prone reformatting needed to traffic data between multiple programs, and streamlines and standardizes the data-analysis process for case-control studies of highly polymorphic data. BIGDAWG has been implemented as the bigdawg R package and as a free web application at bigdawg.immunogenomics.org.

Keywords: Amino-acid analysis; BIGDAWG; Case-control analysis; HLA KIR data analysis; Haplotype analysis; Hardy–Weinberg testing; R package; Web app.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Summary Statistics and Hardy-Weinberg Equilibrium Analysis
Sig (significance) column. * indicates a significant p-value. These p-values have not been corrected for multiple comparisons.
Figure 2
Figure 2. Summarized Association Testing Results
Sig (significance) column. * indicates a significant p-value. These p-values have not been corrected for multiple comparisons. The Amino Acid Analysis results have been shorted for publication.

Similar articles

Cited by

References

    1. Hollenbach JA, Mack SJ, Gourraud PA, Single RM, Maiers M, Middleton D, Thomson G, Marsh SG, Varney MD. A community standard for immunogenomic data reporting and analysis: proposal for a STrengthening the REporting of Immunogenomic Studies statement. Tissue Antigens. 2011;78(5):333. - PMC - PubMed
    1. Lancaster AK, Single RM, Solberg OD, Nelson MP, Thomson G. PyPop update--a software pipeline for large-scale multilocus population genomics. Tissue Antigens. 2007;69(Suppl 1):192. - PMC - PubMed
    1. Excoffier L, Lischer HE. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010;10(3):564. - PubMed
    1. Mack SJ, Gourraud PA, Single RM, Thomson G, Hollenbach JA. Analytical methods for immunogenetic population data. Methods Mol Biol. 2012;882:215. - PMC - PubMed
    1. Gourraud PA, Hollenbach JA, Barnetche T, Single RM, Mack SJ. Standard methods for the management of immunogenetic data. Methods Mol Biol. 2012;882:197. - PMC - PubMed

Publication types