Subjective rating scales: science or art?
- PMID: 12569049
- DOI: 10.1080/00140130210166951
Subjective rating scales: science or art?
Abstract
Subjective rating scales are widely used in almost every aspect of ergonomics research and practice for the assessment of workload, fatigue, usability, annoyance and comfort, and lesser known qualities such as urgency and presence, but are they truly scientific? This paper raises some of the key issues as a basis for debate. First, it is argued that all empirical observations, including those conventionally labelled as 'objective', are unavoidably subjective. Shared meaning between observers, or intersubjectivity, is the key criterion of scientific probity. The practical steps that can be taken to increase intersubjective agreement are discussed and the well-known sources of error and bias in human judgement reviewed. The role of conscious experience as a mechanism for appraising the environment and guiding behaviour has important implications for the interpretation of subjective reports. The view that psychometric measures do not conform to the requirements of truly 'scientific' measurement is discussed. Human judgement of subjective attributes is essentially ordinal and, unlike physical measures, can be matched to interval scales only with difficulty, but ordinal measures can be used successfully both to develop and test substantive theories using multivariate statistical techniques. Constructs such as fatigue are best understood as latent or inferred variables defined by a set of manifest or directly observed indicator variables. Both construct validity and predictive validity are viewed from this perspective and this helps to clarify several problems including the dissociation between measures of different aspects of a given construct, the question of whether physical (e.g. physiological) measures should be preferred to subjective measures and whether a single measure of constructs which are essentially multidimensional having both subjective and physical components is desirable. Finally, the fitness of subjective ratings to different purposes within the broad field of ergonomics research is discussed. For testing of competing hypotheses concerning the mechanisms underlying human performance, precise quantitative predictions are rarely needed. The same is frequently true of comparative evaluation of competing designs. In setting design standards, however, something approaching the level of measurement required for precise quantitative prediction is required, but this is difficult to achieve in practice. Although it may be possible to establish standards within restricted contexts, general standards for broadly conceived constructs such as workload are impractical owing to the requirement for representative sampling of tasks, work environments and personnel.
Comment in
-
Creative (dis)agreement in ergonomics.Ergonomics. 2002 Nov 15;45(14):963-5. doi: 10.1080/00140130210166960. Ergonomics. 2002. PMID: 12569048 No abstract available.
-
Measurement and the practising ergonomist.Ergonomics. 2002 Nov 15;45(14):988-90; discussion 1042-6. doi: 10.1080/00140130210166915. Ergonomics. 2002. PMID: 12569050 No abstract available.
-
Sans subjectivity - ergonomics is engineering.Ergonomics. 2002 Nov 15;45(14):991-4; discussion 1042-6. doi: 10.1080/00140130210166898. Ergonomics. 2002. PMID: 12569051 No abstract available.
-
Is ergonomics empirical?Ergonomics. 2002 Nov 15;45(14):995-7; discussion 1042-6. doi: 10.1080/00140130210166889. Ergonomics. 2002. PMID: 12569052 No abstract available.
-
Subjective measures: not perfect but what is?Ergonomics. 2002 Nov 15;45(14):998-1000; discussion 1042-6. doi: 10.1080/00140130210166870. Ergonomics. 2002. PMID: 12569053 No abstract available.
-
Transactional perspectives on subjective rating scales.Ergonomics. 2002 Nov 15;45(14):1001-4; discussion 1042-6. doi: 10.1080/00140130210166861. Ergonomics. 2002. PMID: 12569054 No abstract available.
-
Use of subjective rating scores in ergonomics research and practice.Ergonomics. 2002 Nov 15;45(14):1005-7; discussion 1042-6. doi: 10.1080/00140130210166852. Ergonomics. 2002. PMID: 12569055 No abstract available.
-
Do ratings measure latent attributes?Ergonomics. 2002 Nov 15;45(14):1008-10; discussion 1042-6. doi: 10.1080/00140130210166843. Ergonomics. 2002. PMID: 12569056 No abstract available.
-
Subjective rating scales: scientific measures of perceived urgency?Ergonomics. 2002 Nov 15;45(14):1011-4; discussion 1042-6. doi: 10.1080/00140130210166834. Ergonomics. 2002. PMID: 12569057 No abstract available.
-
Measurements serve a purpose: a note on a possible use of natural measurements.Ergonomics. 2002 Nov 15;45(14):1015-7; discussion 1042-6. doi: 10.1080/00140130210166825. Ergonomics. 2002. PMID: 12569058 No abstract available.
-
It's all relative: defining mental workload in the light of Annett's paper.Ergonomics. 2002 Nov 15;45(14):1018-20; discussion 1042-6. doi: 10.1080/00140130210166816. Ergonomics. 2002. PMID: 12569059 No abstract available.
-
Subjective evaluation of usability.Ergonomics. 2002 Nov 15;45(14):1021-5; discussion 1042-6. doi: 10.1080/00140130210166807. Ergonomics. 2002. PMID: 12569060 No abstract available.
-
On the assessment of (mental) workload and other subjective qualifications.Ergonomics. 2002 Nov 15;45(14):1026-30; discussion 1042-6. doi: 10.1080/00140130210166799. Ergonomics. 2002. PMID: 12569061 No abstract available.
-
Measurement in virtual environments: another dimension to the objectivity/subjectivity debate.Ergonomics. 2002 Nov 15;45(14):1031-6; discussion 1042-6. doi: 10.1080/00140130210166780. Ergonomics. 2002. PMID: 12569062 No abstract available.
-
Can design supportive research be scientific?Ergonomics. 2002 Nov 15;45(14):1037-41; discussion 1042-6. doi: 10.1080/00140130210166771. Ergonomics. 2002. PMID: 12569063 No abstract available.
MeSH terms
LinkOut - more resources
Full Text Sources
Medical