Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 May;36(4):333-9.
doi: 10.1002/gepi.21625. Epub 2012 Apr 16.

On optimal gene-based analysis of genome scans

Affiliations

On optimal gene-based analysis of genome scans

Silviu-Alin Bacanu. Genet Epidemiol. 2012 May.

Abstract

Univariate analysis of markers has modest power when there are multiple causal variants within a gene. Under this scenario, combining the effects of all variants from a gene in a gene-wide statistic is thought to increase power. However, it is not really clear (1) what is the performance of most commonly used gene-wide methods for whole genome scans and (2) how scalable these methods are for more computationally intensive analyses, e.g. analysis of genome-wide sequence data. We attempt to answer these questions by using realistic simulations to assess the performance of a range of gene-based methods: (1) commonly used, e.g. VEGAS and GATES; (2) less commonly used, e.g. Simes, adaptive sum (aSUM), and kernel methods; and (3) a combination of univariate and multivariate tests we proposed for the analysis of markers in linkage disequilibrium. Simes is the fastest method and has good power for single causal variant models. aSUM method has good power for multiple causal variant models, especially at lower gene lengths. Our proposed statistic yields good power for all causal models. Given the extreme data volumes coming from sequencing studies, we recommend a two step analysis of genome scans. The initial step uses the very fast Simes procedure to flag possibly interesting genes. The second step refines interesting signals by using more computationally intensive methods, e.g. (1) aSUM for shorter and (2) VEGAS for larger gene lengths. Alternatively, genome scans can be analyzed using only our proposed method while sacrificing only a modest amount of power.

PubMed Disclaimer

LinkOut - more resources