Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Feb 11:4:635.
doi: 10.1038/bonekey.2015.2. eCollection 2015.

Big data challenges in bone research: genome-wide association studies and next-generation sequencing

Affiliations

Big data challenges in bone research: genome-wide association studies and next-generation sequencing

Nerea Alonso et al. Bonekey Rep. .

Abstract

Genome-wide association studies (GWAS) have been developed as a practical method to identify genetic loci associated with disease by scanning multiple markers across the genome. Significant advances in the genetics of complex diseases have been made owing to advances in genotyping technologies, the progress of projects such as HapMap and 1000G and the emergence of genetics as a collaborative discipline. Because of its great potential to be used in parallel by multiple collaborators, it is important to adhere to strict protocols assuring data quality and analyses. Quality control analyses must be applied to each sample and each single-nucleotide polymorphism (SNP). The software package PLINK is capable of performing the whole range of necessary quality control tests. Genotype imputation has also been developed to substantially increase the power of GWAS methodology. Imputation permits the investigation of associations at genetic markers that are not directly genotyped. Results of individual GWAS reports can be combined through meta-analysis. Finally, next-generation sequencing (NGS) has gained popularity in recent years through its capacity to analyse a much greater number of markers across the genome. Although NGS platforms are capable of examining a higher number of SNPs compared with GWA studies, the results obtained by NGS require careful interpretation, as their biological correlation is incompletely understood. In this article, we will discuss the basic features of such protocols.

PubMed Disclaimer

Conflict of interest statement

Dr Gavin Lucas is a partner in Clear Genetics. Dr Nerea Alonso and Dr Pirro Hysi declare no potential conflict of interest.

Figures

Figure 1
Figure 1
Flowchart showing the different steps to take into account when performing the quality control testing on a GWAS.
Figure 2
Figure 2
Example of scattered plot in SPSS (IBM Corp., IBM SPSS Statistics for Windows, Armonk, NY, USA) for the PC1 versus PC2, showing some outliers from the Caucasian population (blue dots=CEU, green dots=CHB+JPT, cream dots=YOR, purple dots=study samples).
Figure 3
Figure 3
Data analysis flowchart.
Figure 4
Figure 4
Interface of BCSNPmax software for imputation step, showing the different parameters available for the analysis.
Figure 5
Figure 5
LD blocks for the selected area (TNFRSF11A as example) displaying results for (a) D′ and (b) r2 figures.

Similar articles

Cited by

References

    1. Zeggini E, Ioannidis JP. Meta-analysis in genome-wide association studies. Pharmacogenomics 2009;10:191–201. - PMC - PubMed
    1. Olivier M. A haplotype map of the human genome. Physiol Genomics 2003;13:3–9. - PubMed
    1. International HapMap Consortium. A haplotype map of the human genome. Nature 2005;437:1299–1320. - PMC - PubMed
    1. Frazer KA, Ballinger DG, Cox DR, Hinds DA, Stuve LL, Gibbs RA et al.. A second generation human haplotype map of over 3.1 million SNPs. Nature 2007;449:851–861. - PMC - PubMed
    1. Barsh GS, Copenhaver GP, Gibson G, Williams SM. Guidelines for genome-wide association studies. PLoS Genet 2012;8:e1002812. - PMC - PubMed

LinkOut - more resources