. 2008 Apr;59(4):361-8.

doi: 10.1176/ps.2008.59.4.361.

Using computerized adaptive testing to reduce the burden of mental health assessment

Robert D Gibbons¹, David J Weiss, David J Kupfer, Ellen Frank, Andrea Fagiolini, Victoria J Grochocinski, Dulal K Bhaumik, Angela Stover, R Darrell Bock, Jason C Immekus

Affiliations

PMID: 18378832
PMCID: PMC2916927
DOI: 10.1176/ps.2008.59.4.361

Using computerized adaptive testing to reduce the burden of mental health assessment

Robert D Gibbons et al. Psychiatr Serv. 2008 Apr.

. 2008 Apr;59(4):361-8.

doi: 10.1176/ps.2008.59.4.361.

Authors

Robert D Gibbons¹, David J Weiss, David J Kupfer, Ellen Frank, Andrea Fagiolini, Victoria J Grochocinski, Dulal K Bhaumik, Angela Stover, R Darrell Bock, Jason C Immekus

Affiliation

¹ Center for Health Statistics, University of Illinois at Chicago, Psychiatric Institute 457, M/C 912, Chicago, IL 60680-6998, USA. rdgib@uic.edu

PMID: 18378832
PMCID: PMC2916927
DOI: 10.1176/ps.2008.59.4.361

Abstract

Objective: This study investigated the combination of item response theory and computerized adaptive testing (CAT) for psychiatric measurement as a means of reducing the burden of research and clinical assessments.

Methods: Data were from 800 participants in outpatient treatment for a mood or anxiety disorder; they completed 616 items of the 626-item Mood and Anxiety Spectrum Scales (MASS) at two times. The first administration was used to design and evaluate a CAT version of the MASS by using post hoc simulation. The second confirmed the functioning of CAT in live testing.

Results: Tests of competing models based on item response theory supported the scale's bifactor structure, consisting of a primary dimension and four group factors (mood, panic-agoraphobia, obsessive-compulsive, and social phobia). Both simulated and live CAT showed a 95% average reduction (585 items) in items administered (24 and 30 items, respectively) compared with administration of the full MASS. The correlation between scores on the full MASS and the CAT version was .93. For the mood disorder subscale, differences in scores between two groups of depressed patients--one with bipolar disorder and one without--on the full scale and on the CAT showed effect sizes of .63 (p<.003) and 1.19 (p<.001) standard deviation units, respectively, indicating better discriminant validity for CAT.

Conclusions: Instead of using small fixed-length tests, clinicians can create item banks with a large item pool, and a small set of the items most relevant for a given individual can be administered with no loss of information, yielding a dramatic reduction in administration time and patient and clinician burden.

PubMed Disclaimer

Conflict of interest statement

disclosures

The authors report no competing interests.

Figures

**Figure 1**
Item-by-item administration to a study participant of the Mood and Anxiety Spectrum Scales by use of computerized adaptive testing^a ^a Possible impairment scores range from −4 to 4, with higher scores indicating greater impairment. The vertical bars indicate a standard error band of 1. Final impairment estimate: M±SE=1.26±.30

**Figure 2**
Frequency distribution of impairment estimates for the entire calibration sample (N=800) ^a Possible impairment scores range from −4 to 4, with higher scores indicating greater impairment.

See this image and copyright information in PMC

Comment in

Are we ready for computerized adaptive testing?
Unick GJ, Shumway M, Hargreaves W. Unick GJ, et al. Psychiatr Serv. 2008 Apr;59(4):369. doi: 10.1176/ps.2008.59.4.369. Psychiatr Serv. 2008. PMID: 18378833 Free PMC article. No abstract available.

Cited by

Development of a computerized adaptive test for depression.
Gibbons RD, Weiss DJ, Pilkonis PA, Frank E, Moore T, Kim JB, Kupfer DJ. Gibbons RD, et al. Arch Gen Psychiatry. 2012 Nov;69(11):1104-12. doi: 10.1001/archgenpsychiatry.2012.14. Arch Gen Psychiatry. 2012. PMID: 23117634 Free PMC article.
Differences in Patient Health Questionnaire and Aachen Depression Item Bank scores between tablet versus paper-and-pencil administration.
Spangenberg L, Glaesmer H, Boecker M, Forkmann T. Spangenberg L, et al. Qual Life Res. 2015 Dec;24(12):3023-32. doi: 10.1007/s11136-015-1040-5. Epub 2015 Jun 13. Qual Life Res. 2015. PMID: 26071119
Structure and measurement of depression in youths: applying item response theory to clinical data.
Cole DA, Cai L, Martin NC, Findling RL, Youngstrom EA, Garber J, Curry JF, Hyde JS, Essex MJ, Compas BE, Goodyer IM, Rohde P, Stark KD, Slattery MJ, Forehand R. Cole DA, et al. Psychol Assess. 2011 Dec;23(4):819-33. doi: 10.1037/a0023518. Epub 2011 May 2. Psychol Assess. 2011. PMID: 21534696 Free PMC article.
Psychometric Evaluation of the Improved Work-Disability Functional Assessment Battery.
Meterko M, Marino M, Ni P, Marfeo E, McDonough CM, Jette A, Peterik K, Rasch E, Brandt DE, Chan L. Meterko M, et al. Arch Phys Med Rehabil. 2019 Aug;100(8):1442-1449. doi: 10.1016/j.apmr.2018.09.125. Epub 2018 Dec 19. Arch Phys Med Rehabil. 2019. PMID: 30578775 Free PMC article.
Development of the Preschool Life Impact Burn Recovery Evaluation (PS-LIBRE1-5) Profile.
Patel KF, Ni P, Surette KE, Rencken CA, Rodríguez-Mercedes SL, McGwin MB, Fabia R, Tully C, Warner P, Romanowski KS, Palmieri T, Stoddard FJ Jr, Schneider JC, Kazis LE, Ryan CM. Patel KF, et al. J Burn Care Res. 2024 Jan 5;45(1):136-144. doi: 10.1093/jbcr/irad136. J Burn Care Res. 2024. PMID: 37703100 Free PMC article.

See all "Cited by" articles

References

1. Schaeffer GA, Bridgeman B, Golub-Smith ML, et al. GRE Board Professional Report 95-08P and ETS Research Report 98-38. Princeton, NJ: Educational Testing Service; 1998. Comparability of Paper-and-Pencil and Computer Adaptive Test Scores on the GRE General test.
1. Wainer H. CATs: whither and whence. Psicologica. 2000;21:121–133.
1. Fliege H, Becker J, Walter OB, et al. Development of a computer-adaptive test for depression (D-CAT) Quality of Life Research. 2004;4:2277–2291. - PubMed
1. Ware JE, Bjorner JB, Kosinski MA. Practical implications of item response theory and computerized adaptive testing: a brief summary of ongoing studies of widely used headache impact scales. Medical Care. 2000;38:73–82. - PubMed
1. Weiss DJ. Adaptive testing by computer. Journal of Consulting and Clinical Psychology. 1985;53:774–789. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Using computerized adaptive testing to reduce the burden of mental health assessment

Affiliation

Using computerized adaptive testing to reduce the burden of mental health assessment

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

Comment in

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Miscellaneous

Abstract

Conflict of interest statement

Figures

Comment in

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Miscellaneous