What is the validity evidence for assessments of clinical teaching?

Thomas J Beckman¹, David A Cook, Jayawant N Mandrekar

Affiliations

Affiliation

¹ Division of General Internal Medicine, Department of Internal Medicine, Mayo Clinic College of Medicine, Mayo Clinic and Mayo Foundation, Rochester, MN, USA. Beckman.Thomas@mayo.edu

PMID: 16423109
PMCID: PMC1490286
DOI: 10.1111/j.1525-1497.2005.0258.x

Review

What is the validity evidence for assessments of clinical teaching?

Thomas J Beckman et al. J Gen Intern Med. 2005 Dec.

. 2005 Dec;20(12):1159-64.

doi: 10.1111/j.1525-1497.2005.0258.x.

Authors

Thomas J Beckman¹, David A Cook, Jayawant N Mandrekar

Affiliation

¹ Division of General Internal Medicine, Department of Internal Medicine, Mayo Clinic College of Medicine, Mayo Clinic and Mayo Foundation, Rochester, MN, USA. Beckman.Thomas@mayo.edu

PMID: 16423109
PMCID: PMC1490286
DOI: 10.1111/j.1525-1497.2005.0258.x

Abstract

Background: Although a variety of validity evidence should be utilized when evaluating assessment tools, a review of teaching assessments suggested that authors pursue a limited range of validity evidence.

Objectives: To develop a method for rating validity evidence and to quantify the evidence supporting scores from existing clinical teaching assessment instruments.

Design: A comprehensive search yielded 22 articles on clinical teaching assessments. Using standards outlined by the American Psychological and Education Research Associations, we developed a method for rating the 5 categories of validity evidence reported in each article. We then quantified the validity evidence by summing the ratings for each category. We also calculated weighted kappa coefficients to determine interrater reliabilities for each category of validity evidence.

Main results: Content and Internal Structure evidence received the highest ratings (27 and 32, respectively, of 44 possible). Relation to Other Variables, Consequences, and Response Process received the lowest ratings (9, 2, and 2, respectively). Interrater reliability was good for Content, Internal Structure, and Relation to Other Variables (kappa range 0.52 to 0.96, all P values < .01), but poor for Consequences and Response Process.

Conclusions: Content and Internal Structure evidence is well represented among published assessments of clinical teaching. Evidence for Relation to Other Variables, Consequences, and Response Process receive little attention, and future research should emphasize these categories. The low interrater reliability for Response Process and Consequences likely reflects the scarcity of reported evidence. With further development, our method for rating the validity evidence should prove useful in various settings.

PubMed Disclaimer

References

1. Downing SM. Validity on the meaningful interpretation of assessment data. Med Educ. 2003;37:830–7. - PubMed
1. Crossley J, Humphris G, Jolly B. Assessing health professionals. Med Educ. 2002;36:800–4. - PubMed
1. Beckman TJ, Ghosh AK, Cook DA, Erwin PJ, Mandrekar JN. How reliable are assessments of clinical teaching? A review of the published instruments. J Gen Intern Med. 2004;19:971–7. - PMC - PubMed
1. Beckman TJ, Lee MC, Rohren CH. Evaluating an instrument for the peer review of inpatient teaching. Med Teach. 2003;25:131–5. - PubMed
1. Benbassat J, Bachar E. Validity of students' ratings of clinical instructors. Med Educ. 1981;15:373–6. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

What is the validity evidence for assessments of clinical teaching?

Affiliation

What is the validity evidence for assessments of clinical teaching?

Authors

Affiliation

Abstract

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources