Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Mar;31(3):321-328.
doi: 10.1038/s41431-022-01201-y. Epub 2022 Nov 7.

Genetic prediction of male pattern baldness based on large independent datasets

Affiliations

Genetic prediction of male pattern baldness based on large independent datasets

Yan Chen et al. Eur J Hum Genet. 2023 Mar.

Abstract

Genetic prediction of male pattern baldness (MPB) is important in science and society. Previous genetic MPB prediction models were limited by sparse marker coverage, small sample size, and/or data dependency in the different analytical steps. Here, we present novel models for genetic prediction of MPB based on a large set of markers and large independent subsample sets drawn among 187,435 European subjects. We selected 117 SNP predictors within 85 distinct loci from a list of 270 previously MPB-associated SNPs in 55,573 males of the UK Biobank Study (UKBB). Based on these 117 SNPs with and without age as additional predictor, we trained, by use of different methods, prediction models in a non-overlapping subset of 104,694 UKBB males and tested them in a non-overlapping subset of 26,177 UKBB males. Estimates of prediction accuracy were similar between methods with AUC ranges of 0.725-0.728 for severe, 0.631-0.635 for moderate, 0.598-0.602 for slight, and 0.708-0.711 for no hair loss with age, and slightly lower without, while prediction of any versus no hair loss gave 0.690-0.711 with age and slightly lower without. External validation in an early-onset enriched MPB dataset from the Bonn Study (N = 991) showed improved prediction accuracy without considering age such as AUC of 0.830 for no vs. any hair loss. Because of the large number of markers and the large independent datasets used for the different analytical steps, the newly presented genetic prediction models are the most reliable ones currently available for MPB or any other human appearance trait.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Relationship between the number of SNP predictors and the prediction accuracy expressed as AUC and AIC for 4-category MPB prediction in the marker ascertainment dataset of 55,573 UKBB males.
This dataset includes the 52,874 UKBB men previously applied for discovering the MPB-association of the 270 SNPs [8] used here for feature selection. Based on this dataset, a MLR model was fit in a randomly selected 80% prediction model training subset and applied in the remaining 20% prediction model testing subset with 100 replicates. The estimated mean AUCs are depicted as coloured line and the 5–95% boundary as respective light colour shade for the four predicted MPB categories. The dashed line marks the cut-off at 117 SNPs that were used as best-fit prediction marker set in all subsequent prediction analyses.
Fig. 2
Fig. 2. Individual contribution of the ascertained 117 SNP predictors and age on the prediction accuracy expressed as AUC for the 4-category MLR and the 2-category BLR MPB prediction models obtained from the model testing dataset of 26,177 UKBB men.
These prediction models were built in the prediction model training dataset of 104,694 UKKBB men.
Fig. 3
Fig. 3. Histograms of predicted probability overlaid with percentage of MPB categories (including slight, moderate and severe hair loss) in each probability bin.
A The probability was derived from MPB prediction model with 117 SNPs and age as predictors in the prediction model testing dataset of 26,177 UKBB men. These prediction results were practically informative for about 26.2% of the tested males (0.4% <0.2 and 25.8% >0.8). B The probability was derived from MPB prediction model with 107 SNPs as predictors without age in the external validation dataset of 991 men from the Bonn Study. These prediction results were practically informative for about 41.8% of the tested males (0.2% <0.2 and 41.6% >0.8). These 117-SNP (A) and the 107-SNP (B) MPB prediction models were built in the prediction model training dataset of 104,694 UKBB men.

References

    1. Lolli F, Pallotti F, Rossi A, Fortuna MC, Caro G, Lenzi A, et al. Androgenetic alopecia: a review. Endocrine. 2017;57:9–17. doi: 10.1007/s12020-017-1280-y. - DOI - PubMed
    1. Yap CX, Sidorenko J, Wu Y, Kemper KE, Yang J, Wray NR, et al. Dissection of genetic variation and evidence for pleiotropy in male pattern baldness. Nat Commun. 2018;9:5407. doi: 10.1038/s41467-018-07862-y. - DOI - PMC - PubMed
    1. King TE, Fortes GG, Balaresque P, Thomas MG, Balding D, Delser PM, et al. Identification of the remains of King Richard III. Nat Commun. 2014;5:5631. doi: 10.1038/ncomms6631. - DOI - PMC - PubMed
    1. Walsh S, Liu F, Wollstein A, Kovatsi L, Ralf A, Kosiniak-Kamysz A, et al. The HIrisPlex system for simultaneous prediction of hair and eye colour from DNA. Forensic Sci Int Genet. 2013;7:98–115. doi: 10.1016/j.fsigen.2012.07.005. - DOI - PubMed
    1. Wilde S, Timpson A, Kirsanow K, Kaiser E, Kayser M, Unterlander M, et al. Direct evidence for positive selection of skin, hair, and eye pigmentation in Europeans during the last 5,000 y. Proc Natl Acad Sci USA. 2014;111:4832–7. doi: 10.1073/pnas.1316513111. - DOI - PMC - PubMed

Publication types