Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Sep:80:102118.
doi: 10.1016/j.infbeh.2025.102118. Epub 2025 Aug 6.

Development and validation of the NIH Baby Toolbox® Executive Function and Memory measures

Affiliations
Free article

Development and validation of the NIH Baby Toolbox® Executive Function and Memory measures

Rachel M Flynn et al. Infant Behav Dev. 2025 Sep.
Free article

Abstract

Existing standardized assessments of Executive Function and Memory (EF-Mem) for infants and toddlers tend to be limited in scope, can be burdensome to administer, and/or do not often utilize modern technology. Here, we describe the development and validation of the novel iPad-based EF-Mem measures in the NIH Baby Toolbox® for infants and young children 1-42 months old. English and Spanish versions of gaze-based (Familiarization) and touch-based measures (Mullen Visual Reception, Visual Delayed Response, Delayed Memory Task) were adapted for iPad administration. A nationally representative sample of children (N = 2515 recruited; N = 2448 who completed at least one EF-Mem measure; n = 1993 English, n = 455 Spanish) were administered the Baby Toolbox EF-Mem measures as part of the full battery of Baby Toolbox measures. Most caregivers completed the Ages and Stages Questionnaire 3rd edition (ASQ-3), and separate subsets of children were administered the Bayley Scales for Infant Development 4th Edition (Bayley-4, n = 120 recruited; n = 117 that completed both EF-Mem and the Bayley-4) or were administered the Baby Toolbox battery again 1-14 days later to assess test-retest reliability (n = 220 recruited; n = 187 who completed any EF-Mem measure twice). Measure-level scores showed expected correlations with age for gaze and touch measures. EF-Mem scores showed expected correlations with the Bayley-4 for both gaze and touch, indicating convergent validity, and varied meaningfully based on the ASQ-3 classifications. Test-retest reliability for all measures and empirical reliability of the touch scores were moderate. Automatic coding and scoring, ease of administration, reliability, and validity of Baby Toolbox EF-Mem make these measures valuable for developmental assessment.

Keywords: Assessment; Executive function; Immediate learning; Memory; NIH Baby Toolbox.

PubMed Disclaimer

Similar articles

Cited by

  • Automated iPad-based gaze detection in the NIH Baby Toolbox® norming study.
    Novack MA, Han YC, Kaat AJ, Pila S, Flynn RM, Bedjeti K, Diaz MV, Hanrahan RT, Glinberg S, Sievert PH, Frederick C, Rajiv P, Clare C, Ustsinovich V, Gershon RC. Novack MA, et al. Infant Behav Dev. 2025 Sep;80:102119. doi: 10.1016/j.infbeh.2025.102119. Epub 2025 Aug 1. Infant Behav Dev. 2025. PMID: 40752054
  • Development and validation of the NIH Baby Toolbox® Math measures.
    Pila S, Han YC, Adam H, Ustsinovich V, Kaat AJ, Sarama J, Clements DH, Gershon RC. Pila S, et al. Infant Behav Dev. 2025 Sep;80:102116. doi: 10.1016/j.infbeh.2025.102116. Epub 2025 Jul 29. Infant Behav Dev. 2025. PMID: 40737983
  • NIH Baby Toolbox® methodology and norms development.
    Han YC, Dworak EM, Mansolf M, Adam H, Yao L, Novack MA, Pila S, Flynn RM, Flagg AM, Ustsinovich V, Savio K, Byrne GJ, Gershon RC, Kaat AJ. Han YC, et al. Infant Behav Dev. 2025 Sep;80:102117. doi: 10.1016/j.infbeh.2025.102117. Epub 2025 Jul 30. Infant Behav Dev. 2025. PMID: 40743801

Publication types