Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Aug 14:8:101491.
doi: 10.1016/j.mex.2021.101491. eCollection 2021.

Energy transition pathways amongst low-income urban households: A mixed method clustering approach

Affiliations

Energy transition pathways amongst low-income urban households: A mixed method clustering approach

André P Neto-Bradley et al. MethodsX. .

Abstract

Studies on clean energy transition amongst low-income urban households in the Global South use an array of qualitative and quantitative methods. However, attempts to combine qualitative and quantitative methods are rare and there are a lack of systematic approaches to this. This paper demonstrates a two stage approach using clustering methods to analyse a mixed dataset containing quantitative household survey data and qualitative interview data. By clustering the quantitative and qualitative data separately, latent groups with common characteristics and narratives arising from each of the two analyses are identified. A second stage of clustering identifies links between these qualitative and quantitative clusters and enables inference of energy transition pathways followed by low-income urban households defined by both quantitative characteristics and qualitative narratives. This approach can support interdisciplinary collaboration in energy research, providing a systematic approach to comparing and identifying links between quantitative and qualitative findings.•A mixed dataset comprising of quantitative survey data and qualitative interview data on low-income household energy use is analysed using hierarchical clustering to detect communities within each dataset.•Interviewees are matched to quantitative survey clusters and a second stage of clustering is performed using cluster membership as variables.•Second stage clusters identify common pairs of survey and interview clusters which define energy transition pathways based on socio-economic characteristics, energy use patterns, and narratives for decision making and practices.

Keywords: Clustering; Data science; Energy access; Mixed methods.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Image, graphical abstract
Graphical abstract
Fig1
Fig. 1
Schematic overview of mixed method cluster analysis. The analysis starts with a first stage clustering of both the quantitative survey data and the qualitative interview transcripts. Hierarchical clustering methods are used in this first stage, with the interviews coded for the qualitative clustering. The interview respondents are then matched to a corresponding survey cluster based on common socio-economic variables and a second stage of clustering is performed to identify distinct combinations of qualitative and quantitative clusters that characterise different energy transition pathways.
Fig2
Fig. 2
Schematic of mixed data structure. Two distinct datasets are required: one contains qualitative household level survey data with a mix of energy and socio-economic variables, the other dataset consists of a table of codings applied to the transcripts of interviewed households.
Fig3
Fig. 3
Example of use of Silhouette Width and Slope Statistic to determine optimal number of clusters for a sample of households in Bangalore. Note how while there are maxima on the silhouette width plot at 5 and 16 the slope statistic indicates 5 as preferable.
Fig4
Fig. 4
Schematic of interview coding clustering process. Coded interview table is transformed into an adjacency matric by calculating correlation between respondents. This adjacency matrix is then clustered to assign interviewees to respective clusters.
Fig5
Fig. 5
Network graph of communities detected amongst interviewees based on interview coding correlation. These groups form the qualitative interview clusters.
Fig6
Fig. 6
Schematic of interviewee survey cluster matching based on socio-economic variables.
Fig7
Fig. 7
Schematic of second stage clustering and pathway characterisation. K-means clustering of the interview households using their qualitative and quantitative survey clusters as variables identifies pathway groups with unique pairs of.
Fig8
Fig. 8
Graphic showing pathway cluster membership, associated socio-economic characteristics, and key qualitative interview data codes for each pathway in the example of low-income households in Bangalore.

References

    1. Anderson D.R., David R. Statistics for Business and Economics. 5th ed. Thomson South-Western; Mason, OH: 2009. Essentials of statistics for business and economics. c2009., Mason, OH.
    1. Bansal N., Blum A., Chawla S. Correlation clustering. Mach. Learn. 2004;56:89–113. doi: 10.1023/B:MACH.0000033116.57574.95. - DOI
    1. Bickart B., Felcher E.M. Answering Questions. Jossey-Bass; San Francisco: 1996. Expanding and enhancing the use of verbal protocols in survey research.
    1. Bischoping K. New Techniques for Pretesting Survey Questions. Survey Research Center; Ann Arbor, MI: 1989. An evaluation of interviewer debriefing in survey pretests.
    1. Caliński T., Harabasz J. A dendrite method for cluster analysis. Commun. Stat. 1974;3:1–27. doi: 10.1080/03610927408827101. - DOI

LinkOut - more resources