Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Feb;28(1):82-90.
doi: 10.1177/0963721418807729. Epub 2019 Jan 9.

Phenotypic Annotation: Using Polygenic Scores to Translate Discoveries From Genome-Wide Association Studies From the Top Down

Affiliations

Phenotypic Annotation: Using Polygenic Scores to Translate Discoveries From Genome-Wide Association Studies From the Top Down

Daniel W Belsky et al. Curr Dir Psychol Sci. 2019 Feb.

Abstract

Genome-wide association studies (GWASs) have identified specific genetic variants associated with complex human traits and behaviors, such as educational attainment, mental disorders, and personality. However, small effect sizes for individual variants, uncertainty regarding the biological function of discovered genotypes, and potential "outside-the-skin" environmental mechanisms leave a translational gulf between GWAS results and scientific understanding that will improve human health and well-being. We propose a set of social, behavioral, and brain-science research activities that map discovered genotypes to neural, developmental, and social mechanisms and call this research program phenotypic annotation. Phenotypic annotation involves (a) elaborating the nomological network surrounding discovered genotypes, (b) shifting focus from individual genes to whole genomes, and (c) testing how discovered genotypes affect life-span development. Phenotypic-annotation research is already advancing the understanding of GWAS discoveries for educational attainment and schizophrenia. We review examples and discuss methodological considerations for psychologists taking up the phenotypic-annotation approach.

Keywords: GWAS; development; genetics; gene–environment correlation; polygenic scores.

PubMed Disclaimer

Conflict of interest statement

Declaration of Conflicting Interests The author(s) declared that there were no conflicts of interest with respect to the authorship or the publication of this article.

Figures

Fig. 1.
Fig. 1.
Polygenic scoring. Polygenic scores aggregate information from up to millions of single-nucleotide polymorphisms (SNPs) across the genome into a single composite index summarizing genome-wide genetic influence on a target phenotype. The procedure of computing a polygenic score is conducted using whole-genome SNP data on a research participant and scoring information from some external database, typically a published genome-wide association study (GWAS). In the first step, the participant’s SNP genotypes are assigned weights that indicate the direction and magnitude of that SNP’s association with a target phenotype. Most often these weights are the effect sizes, b^j, estimated in a GWAS that did not include the research participant. Next, the number of phenotype-associated alleles (0, 1, or 2) at each SNP (j) is counted, and the count is multiplied by the weight, b^j. Finally, the weighted count is summed across SNPs to compute the participant’s polygenic score. The resulting distribution of polygenic scores across participants is normal. Polygenic-score analysis often involves two additional features. First, because polygenic scores draw information from many sites across the genome, they are sensitive to bias arising from allele-frequency differences between populations of different ancestry, called population stratification (Martin et al., 2017). Polygenic-score analysis is therefore typically conducted within populations that share genetic ancestry (e.g., Europeans), and the analysis usually includes covariate adjustment for principal components estimated from genome-wide SNP data to account for any residual population stratification (Price et al., 2006). Second, polygenic-score analysis is sometimes restricted to a subset of SNPs. This includes procedures to select SNPs that are statistically independent of one another and procedures to select SNPs that meet other criteria, such as having p values in the discovery GWAS that fall below a certain threshold. To date, evidence suggests that polygenic scores are best constructed using data from all available SNPs (Dudbridge & Newcombe, 2016; Ware et al., 2017).

References

    1. Bansal V, Mitjans M, Burik CAP, Linnér RK, Okbay A, Rietveld CA, … Koellinger PD (2018). Genome-wide association study results for educational attainment aid in identifying genetic heterogeneity of schizophrenia. Nature Communications, 9(1), Article 3078. doi:10.1038/s41467-018-05510-z - DOI - PMC - PubMed
    1. Barcellos SH, Carvalho LS, & Turley P. (2018). Education can reduce health disparities related to genetic risk of obesity: Evidence from a British reform. BioRxiv. doi:10.1101/260463 - DOI - PMC - PubMed
    1. Barth D, Papageorge NW, & Thom K. (2017). Genetic ability, wealth, and financial decision-making (IZA Discussion Paper No. 10567). Rochester, NY: Social Science Research Network. Retrieved from SSRN website: https://papers.ssrn.com/abstract=2923653
    1. Bates TC, Maher BS, Medland SE, McAloney K, Wright MJ, Hansell NK, … Gillespie NA (2018). The nature of nurture: Using a virtual-parent design to test parenting effects on children’s educational attainment in genotyped families. Twin Research and Human Genetics, 21, 73–83. doi:10.1017/thg.2018.11 - DOI - PubMed
    1. Belsky DW, Caspi A, Arseneault L, Corcoran D, Domingue BW, Harris KM, … Odgers C. (2018). Genetics and the geography of health, behavior, and attainment. BioRxiv. doi:10.1101/376897 - DOI - PMC - PubMed

Recommended Reading

    1. Belsky DW, Moffitt TE, Corcoran DL, Domingue B, Harrington H, Hogan S, … Caspi A. (2016). (See References). An example of comprehensive phenotypic-annotation analyses applied to developmental data.
    1. Plomin R, Haworth CMA, & Davis OSP (2009). (See References). Contains high-quality figures that illustrate polygenic influence on traits.
    1. Scarr S, & McCartney K. (1983). (See References). A classic theoretical article on the importance of the environment for understanding genetic effects.
    1. Visscher PM, Wray NR, Zhang Q, Sklar P, McCarthy MI, Brown MA, & Yang J. (2017). (See References). A general summary of recent discoveries in genome-wide association studies.

LinkOut - more resources