Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2023 Dec 2:2023.01.30.523509.
doi: 10.1101/2023.01.30.523509.

Normative Modeling of Brain Morphometry Across the Lifespan Using CentileBrain: Algorithm Benchmarking and Model Optimization

Ruiyang Ge  1 Yuetong Yu  1 Yi Xuan Qi  1 Yunan Vera Fan  1 Shiyu Chen  1 Chuntong Gao  1 Shalaila S Haas  2 Amirhossein Modabbernia  2 Faye New  2 Ingrid Agartz  3   4   5 Philip Asherson  6 Rosa Ayesa-Arriola  7 Nerisa Banaj  8 Tobias Banaschewski  9 Sarah Baumeister  9 Alessandro Bertolino  10 Dorret I Boomsma  11 Stefan Borgwardt  12 Josiane Bourque  13 Daniel Brandeis  9   14 Alan Breier  15 Henry Brodaty  16 Rachel M Brouwer  17   18 Randy Buckner  19   20 Jan K Buitelaar  21 Dara M Cannon  22 Xavier Caseras  23 Simon Cervenka  5   24 Patricia J Conrod  25 Benedicto Crespo-Facorro  26   27 Fabrice Crivello  28 Eveline A Crone  29   30 Liewe de Haan  31 Greig I de Zubicaray  32 Annabella Di Giorgio  33 Susanne Erk  34 Simon E Fisher  35   36 Barbara Franke  37   38 Thomas Frodl  39 David C Glahn  40 Dominik Grotegerd  41 Oliver Gruber  42 Patricia Gruner  43 Raquel E Gur  13 Ruben C Gur  13 Ben J Harrison  44 Sean N Hatton  45 Ian Hickie  46 Fleur M Howells  47 Hilleke E Hulshoff Pol  17   48 Chaim Huyser  49 Terry L Jernigan  50 Jiyang Jiang  16 John A Joska  51 René S Kahn  2 Andrew J Kalnin  52 Nicole A Kochan  16 Sanne Koops  53 Jonna Kuntsi  6 Jim Lagopoulos  54 Luisa Lazaro  55 Irina S Lebedeva  56 Christine Lochner  57 Nicholas G Martin  58 Bernard Mazoyer  28 Brenna C McDonald  59 Colm McDonald  60 Katie L McMahon  61 Tomohiro Nakao  62 Lars Nyberg  63 Fabrizio Piras  8 Maria J Portella  27   64 Jiang Qiu  65   66   67 Joshua L Roffman  68 Perminder S Sachdev  16 Nicole Sanford  1 Theodore D Satterthwaite  13 Andrew J Saykin  59 Gunter Schumann  69 Carl M Sellgren  5   70 Kang Sim  71 Jordan W Smoller  72 Jair Soares  73 Iris E Sommer  74 Gianfranco Spalletta  8 Dan J Stein  75 Christian K Tamnes  3   4   76 Sophia I Thomopolous  77 Alexander S Tomyshev  56 Diana Tordesillas-Gutiérrez  78 Julian N Trollor  16   79 Dennis van 't Ent  11 Odile A van den Heuvel  80   81 Theo Gm van Erp  82 Neeltje Em van Haren  83 Daniela Vecchio  8 Dick J Veltman  80 Henrik Walter  34 Yang Wang  84 Bernd Weber  85 Dongtao Wei  65   66 Wei Wen  16 Lars T Westlye  86 Lara M Wierenga  30 Steven Cr Williams  87 Margaret J Wright  88 Sarah Medland  88 Mon-Ju Wu  89 Kevin Yu  1 Neda Jahanshad  77 Paul M Thompson  77 Sophia Frangou  1   2
Affiliations

Normative Modeling of Brain Morphometry Across the Lifespan Using CentileBrain: Algorithm Benchmarking and Model Optimization

Ruiyang Ge et al. bioRxiv. .

Update in

Abstract

We present an empirically benchmarked framework for sex-specific normative modeling of brain morphometry that can inform about the biological and behavioral significance of deviations from typical age-related neuroanatomical changes and support future study designs. This framework was developed using regional morphometric data from 37,407 healthy individuals (53% female; aged 3-90 years) following a comparative evaluation of eight algorithms and multiple covariate combinations pertaining to image acquisition and quality, parcellation software versions, global neuroimaging measures, and longitudinal stability. The Multivariate Factorial Polynomial Regression (MFPR) emerged as the preferred algorithm optimized using nonlinear polynomials for age and linear effects of global measures as covariates. The MFPR models showed excellent accuracy across the lifespan and within distinct age-bins, and longitudinal stability over a 2-year period. The performance of all MFPR models plateaued at sample sizes exceeding 3,000 study participants. The model and scripts described here are freely available through CentileBrain (https://centilebrain.org/).

PubMed Disclaimer

Conflict of interest statement

Declaration of interests SSH is supported by NIH National Institute of Mental Health (T32MH122394), and received a travel award from the Society of Biological Psychiatry to attend the annual meeting in 2023. HB declares an institutional grant from the National Health and Medical Research Council; has received compensation for being on an advisory board or a consultant to Biogen, Eisai, Eli Lilly, Roche, and Skin2Neuron; payment for being on the Cranbrook Care Medical Advisory Board, and honoraria for being on the Montefiore Homes Clinical Advisory Board. RMB and HEHP declare partial funding through the Geestkracht programme of the Dutch Health Research Council (Zon-Mw, grant No 10-000-1001), and matching funds from participating pharmaceutical companies (Lundbeck, AstraZeneca, Eli Lilly, Janssen Cilag) and universities and mental health care organizations (Amsterdam: Academic Psychiatric Centre of the Academic Medical Center and the mental health institutions: GGZ Ingeest, Arkin, Dijk en Duin, GGZ Rivierduinen, Erasmus Medical Centre, GGZ Noord Holland Noord. Groningen: University Medical Center Groningen and the mental health institutions: Lentis, GGZ Friesland, GGZ Drenthe, Dimence, Mediant, GGNet Warnsveld, Yulius Dordrecht and Parnassia psycho-medical center The Hague. Maastricht: Maastricht University Medical Centre and the mental health institutions: GGzE, GGZ Breburg, GGZ Oost-Brabant, Vincent van Gogh voor Geestelijke Gezondheid, Mondriaan, Virenze riagg, Zuyderland GGZ, MET ggz, Universitair Centrum Sint-Jozef Kortenberg, CAPRI University of Antwerp, PC Ziekeren Sint-Truiden, PZ Sancta Maria Sint-Truiden, GGZ Overpelt, OPZ Rekem. Utrecht: University Medical Center Utrecht and the mental health institutions Altrecht, GGZ Centraal and Delta), Nederlandse Organisatie voor Wetenschappelijk Onderzoek (NWO 51.02.061 to H.H., NWO 51.02.062 to D. B., NWO–NIHC Programs of excellence 433-09-220 to H.H., NWO-MagW 480-04-004 to D. B., and NWO/SPI 56-464-14192 to D.B.); FP7 Ideas: European Research Council (ERC-230374 to D. B.); and Universiteit Utrecht (High Potential Grant to H. H.). RB declares funding by NIH National Institute on Aging (R01AG067420); compensation for being on the scientific advisory board from Alkermes and Cognito Therapeutics with no conflict to the present work; honoraria from academic institutions for talks all under $1000 and $1000 for speaking at MGH/HMS course; travel fees for services to attend the annual meeting from the Simons Foundation; serves as a Director on the Simons Foundation collaborative initiative on aging (SCPAB); is a paid scientific advisory board member for philanthropic grants for The Foundation for OCD Research and the Klarman Family Foundation. BF has received educational speaking fees from Medice. DG reports funding from the NIH. UD is funded through the German Research Foundation (DFG; DA 1151/9- 1, DA 1151/10- 1, DA 1151/11- 1). GS declares funding from the European Commission, DFG, and NSFC. CKT has received grants from the Research Council of Norway and the Norwegian Regional Health Authority, unrelated to the current work. HW reports funding from the German Research Foundation (WA 1539/11-1). NJ reports funding from the NIH and compensation from the International Neuropsychological Society. PT declares a grant from the NIH and travel funded by NIH grants. All other authors declare no competing interests.

Figures

Figure 1.
Figure 1.. Flowchart of normative model optimization
(1) The study sample was stratified by sex and then split into training (80%) and testing (20%) datasets, followed by outlier removal, and mean-centering; (2) Normative models were generated using eight different algorithms and compared in terms of accuracy and computational efficiency; (3) Explanatory variables were added to identify the appropriate combination for optimal model performance. ICV=intracranial volume; IQR=interquartile range
Figure 2.
Figure 2.. Illustrative examples of comparative algorithm performance
Algorithm performance for each regional morphometric measure was assessed separately in males and females using the mean absolute error (MAE), the root mean square error (RMSE), explained variance (EV), and the central processing unit (CPU) time. The MAE, RMSE, EV, and CPU times of the models for left thalamic volume (left panel), the left medial orbitofrontal cortical thickness (middle panel) and surface area (right panel) as exemplars here for females and in appendix 1, p 4, figure S3 for males. The pattern identified was the same across all region-specific models and in both sexes (appendix 1, pp 5–6, figures S4-S5). HBR=Hierarchical Bayesian Regression; OLSR=Ordinary Least Squares Regression; BLR=Bayesian Linear Regression; GAMLSS=Generalized Additive Models for Location, Scale and Shape; LMS=Lambda (λ), Mu (μ), Sigma (σ) Quantile Regression; GPR=Gaussian Process Regression; WBLR=Warped Bayesian Linear Regression; MFPR=Multivariate Fractional Polynomial Regression.
Figure 3.
Figure 3.. Illustrative examples of the performance of MFPR-derived models as a function of explanatory variables
For each regional morphometric measure, sex-specific models derived from all algorithms were trained and tested using nine different covariate combinations that included effects of age, FreeSurfer version (FS), Euler Number, scanner vendor, intracranial volume (ICV) and global estimates of mean cortical thickness or area. The mean absolute error (MAE) and root mean square error (RMSE) of models for left thalamic volume (left panel), the left medial orbitofrontal cortical thickness (middle panel), and surface area (right panel) derived from Multivariate Fractional Polynomial Regression (MFPR) for females are presented as exemplars; the optimal variable combination is marked with a dashed frame. The corresponding data for males are presented in appendix 1 (p 7, figure S6). The data for other algorithms are shown in appendix 1 (pp 8–12, figures S7-S11 and S2). In both sexes, the pattern identified was identical for all region-specific models.
Figure 4.
Figure 4.. Illustrative examples of the comparative performance of OLSR, BLR, HBR, GPR, GAMLSS, WBLR, and MFPR-derived optimised models
Region-specific models with the optimised covariate combination were estimated in males and females separately using Ordinary Least Squares Regression (OLSR), Bayesian Linear Regression (BLR), Hierarchical Bayesian Regression (HBR), Gaussian Process Regression (GPR), Generalized Additive Models for Location, Scale, and Shape (GAMLSS), Warped Bayesian Linear Regression (WBLR), and Multivariable Fractional Polynomial Regression (MFPR). Model performance was assessed in terms of mean absolute error (MAE), root mean square error (RMSE), and central processing unit (CPU). The MAE, RSME, and CPU time of the models for left thalamic volume (left panel), the left medial orbitofrontal cortical thickness (middle panel), and surface area (right panel) in females are presented as exemplars and in appendix 1 (p 10, figure S9) for males.
Figure 5.
Figure 5.. Performance of region-specific MFPR-derived models as a function of sample size
Models for each regional morphometric measure were estimated in random sex-specific subsets of 200 to 15,000 participants, in increments of 200, generated from the study sample. Each line represents the values of the mean absolute error (MAE), or root mean square error (RMSE) derived from the optimized Multivariate Fractional Polynomial Regression (MFPR) models of all regional morphometric measure as a function of sample size; shadowed area represents the standard deviation. The pattern identified was identical in both sexes. The data for females are shown here and for males in appendix 1 (p 13, figure S12).
Figure 6.
Figure 6.. Performance of region-specific models in distinct age bins
Sex- and region-specific models of all morphometric measures for different age bins were estimated by partitioning the sex-specific training and testing subsets of the study sample into nine age bins (i.e., age≤10 years; 10
Figure 7.
Figure 7.. Stability of the normative deviation scores (Z-scores) in longitudinal neuroimaging data
We illustrate the stability of the optimized MFPR-derived models over an average interval of two years in data from the SLIM and QTAB samples using the left thalamic volume (left panel), the left medial orbitofrontal cortical thickness (middle panel), and surface area (right panel) as exemplars. Within each panel, the left-hand figure shows the Z-scores of each participant at baseline and follow-up and the right-hand figure shows the distribution of the mean absolute error (MAE) and root mean squared error (RMSE) at baseline and follow-up. SLIM=Southwest Longitudinal Imaging Multimodal Study; QTAB= Queensland Twin Adolescent Brain Study.
Figure 8.
Figure 8.. Accuracy of diagnostic classification and accuracy of psychotic symptom prediction using brain regional normative deviation scores or observed neuromorphometric data
Left panel: Diagnostic classification accuracy in the Human Connectome Project -Early Psychosis (HCP-EP) sample. Receiver Operating Characteristic (ROC) curves of the models distinguishing patients from controls using the observed regional neuromorphometric measures (blue curve) or the deviation Z-scores from the normative model (orange curve); Middle panel: the area under the curve (AUC) difference between a support vector machine classifier using the observed regional neuromorphometric measures and another using regional normative deviation scores (Z-scores) derived from the optimised multivariate fractional polynomial regression (MFPR) model was examined through 1000 permutations. The AUC difference is marked by a vertical dotted line. Right panel: Predictive Accuracy of Psychotic Symptoms in the HCP-EP sample. The mean absolute error (MAE) difference between a ridge regression using the observed regional neuromorphometric measures and another using Z-scores derived from the optimised MFPR model was examined through 1000 permutations. The MAE difference is marked by a vertical dotted line. Information on other models in appendix 1, p 22, figure S19

References

    1. Bethlehem RAI, Seidlitz J, White SR, et al. Brain charts for the human lifespan. Nature 2022;604: 525–33. - PMC - PubMed
    1. Dima D, Modabbernia A, Papachristou E, et al. Subcortical volumes across the lifespan: data from 18,605 healthy individuals aged 3-90 years. Hum Brain Mapp 2022; 43: 452–69. - PMC - PubMed
    1. Frangou S, Modabbernia A, Williams SCR, et al. Cortical thickness across the lifespan: data from 17,075 healthy individuals aged 3-90 years. Hum Brain Mapp 2022; 43: 431–51. - PMC - PubMed
    1. Potvin O, Dieumegarde L, Duchesne S, et al. NOMIS: quantifying morphometric deviation from normality over the lifetime in the adult human brain. bioRxiv 2022; published online Feb 23. doi: 10.1101/2021.01.25.428063 (preprint). - DOI
    1. Villalon-Reina JE, Moreau CA, Nir TM, et al. Multi-site normative modeling of diffusion tensor imaging metrics using hierarchical Bayesian regression. In: Wang L, Dou Q, Fletcher P, Speidel S, Li S, editors. Medical Image Computing and Computer Assisted Intervention – MICCAI 2022. MICCAI 2022 Lecture Notes in Computer Science. Springer, Cham: 2022; 13431: 207–17. - PMC - PubMed

Publication types