Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Sep 1;33(17):2776-2778.
doi: 10.1093/bioinformatics/btx299.

FlashPCA2: principal component analysis of Biobank-scale genotype datasets

Affiliations

FlashPCA2: principal component analysis of Biobank-scale genotype datasets

Gad Abraham et al. Bioinformatics. .

Abstract

Motivation: Principal component analysis (PCA) is a crucial step in quality control of genomic data and a common approach for understanding population genetic structure. With the advent of large genotyping studies involving hundreds of thousands of individuals, standard approaches are no longer feasible. However, when the full decomposition is not required, substantial computational savings can be made.

Results: We present FlashPCA2, a tool that can perform partial PCA on 1 million individuals faster than competing approaches, while requiring substantially less memory.

Availability and implementation: https://github.com/gabraham/flashpca .

Contact: gad.abraham@unimelb.edu.au.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

LinkOut - more resources