Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jun 11:7:146.
doi: 10.3389/fpubh.2019.00146. eCollection 2019.

Two-Part Models and Quantile Regression for the Analysis of Survey Data With a Spike. The Example of Satisfaction With Health Care

Affiliations

Two-Part Models and Quantile Regression for the Analysis of Survey Data With a Spike. The Example of Satisfaction With Health Care

Odile Sauzet et al. Front Public Health. .

Abstract

Background: Results of patient satisfaction questionnaires can contain a spike at the value corresponding to a complete satisfaction. A possible interpretation is that there are two types of respondents, those who are willing to provide a negative evaluation to one or more items proposed in the questionnaire and those who will always provide a completely positive evaluation irrespective of the item. The aim of the present study is to compare various statistical approaches to the analysis of such data using data from a rehabilitation patient survey of the German Statutory Pension Insurance Scheme as an example. Method: We used data from 272,806 respondents who participated in the survey from 2008 to 2011. We illustrate four models: linear regression, logistic regression, a two-part model based on the assumption of two underlying populations and quantile regression, which does not require any distributional assumptions. For each model we consider the relationship of the satisfaction score with two covariates. Results: While providing correct estimates of the mean values (marginal effects), the assumptions of the linear model are violated which can lead to false interpretations. A two-part regression which consists of a logistic regression followed by a linear regression conditional on not being fully satisfied is a useful alternative. For research questions focusing on specific parts of the distribution, logistic regression as well as quantile regression are to be considered. Discussion: Data with a spike represents a statistical challenge but a range of modeling approaches is available to provide sound interpretations and correct answers to research questions.

Keywords: data with spike; linear regresion; quantile regression; satisfaction survey; two-part regression models.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Histogram of satisfaction with medical care score (left) and predicted values (right) of a linear regression with dependent variables age and nationality (German/non-German).
Figure 2
Figure 2
Scatter plots of satisfaction with medical care scores by date of birth in four random sub-samples of 2,000 observations each.
Figure 3
Figure 3
Box plots of satisfaction with medical care per age categories.
Figure 4
Figure 4
Quantile regression coefficients against quantiles.

Similar articles

Cited by

References

    1. Bleich SN, Özaltin E, Murray CJ. How does satisfaction with the health-care system relate to patient experience? Bull World Health Organ. (2009) 87:271–8. 10.2471/BLT.07.050401 - DOI - PMC - PubMed
    1. Collins K, O'Cathain A. The continuum of patient satisfaction-from satisfied to very satisfied. Soc Sci Med (1982). (2003) 57:2465–70. 10.1016/S0277-9536(03)00098-4 - DOI - PubMed
    1. Sitzia J. How valid and reliable are patient satisfaction data? An analysis of 195 studies. Int J Qual Health Care. (1999) 11:319–28. 10.1093/intqhc/11.4.319 - DOI - PubMed
    1. Chang BH, Pocock S. Analyzing data with clumping at zero. An example demonstration. J Clin Epidemiol. (2000) 53:1036–43. 10.1016/S0895-4356(00)00223-7 - DOI - PubMed
    1. Breitling LP, Brenner H. Odd odds interactions introduced through dichotomisation of continuous outcomes. J Epidemiol Commun Health. (2010) 64:300–3. 10.1136/jech.2009.089458 - DOI - PubMed

LinkOut - more resources