Nonrepresentativeness of Human Mobility Data and its Impact on Modeling Dynamics of the COVID-19 Pandemic: Systematic Evaluation
- PMID: 38941609
- PMCID: PMC11245661
- DOI: 10.2196/55013
Nonrepresentativeness of Human Mobility Data and its Impact on Modeling Dynamics of the COVID-19 Pandemic: Systematic Evaluation
Abstract
Background: In recent years, a range of novel smartphone-derived data streams about human mobility have become available on a near-real-time basis. These data have been used, for example, to perform traffic forecasting and epidemic modeling. During the COVID-19 pandemic in particular, human travel behavior has been considered a key component of epidemiological modeling to provide more reliable estimates about the volumes of the pandemic's importation and transmission routes, or to identify hot spots. However, nearly universally in the literature, the representativeness of these data, how they relate to the underlying real-world human mobility, has been overlooked. This disconnect between data and reality is especially relevant in the case of socially disadvantaged minorities.
Objective: The objective of this study is to illustrate the nonrepresentativeness of data on human mobility and the impact of this nonrepresentativeness on modeling dynamics of the epidemic. This study systematically evaluates how real-world travel flows differ from census-based estimations, especially in the case of socially disadvantaged minorities, such as older adults and women, and further measures biases introduced by this difference in epidemiological studies.
Methods: To understand the demographic composition of population movements, a nationwide mobility data set from 318 million mobile phone users in China from January 1 to February 29, 2020, was curated. Specifically, we quantified the disparity in the population composition between actual migrations and resident composition according to census data, and shows how this nonrepresentativeness impacts epidemiological modeling by constructing an age-structured SEIR (Susceptible-Exposed-Infected- Recovered) model of COVID-19 transmission.
Results: We found a significant difference in the demographic composition between those who travel and the overall population. In the population flows, 59% (n=20,067,526) of travelers are young and 36% (n=12,210,565) of them are middle-aged (P<.001), which is completely different from the overall adult population composition of China (where 36% of individuals are young and 40% of them are middle-aged). This difference would introduce a striking bias in epidemiological studies: the estimation of maximum daily infections differs nearly 3 times, and the peak time has a large gap of 46 days.
Conclusions: The difference between actual migrations and resident composition strongly impacts outcomes of epidemiological forecasts, which typically assume that flows represent underlying demographics. Our findings imply that it is necessary to measure and quantify the inherent biases related to nonrepresentativeness for accurate epidemiological surveillance and forecasting.
Keywords: COVID-19; data representativeness; epidemiological modeling; human mobility; population composition.
©Chuchu Liu, Petter Holme, Sune Lehmann, Wenchuan Yang, Xin Lu. Originally published in JMIR Formative Research (https://formative.jmir.org), 28.06.2024.
Conflict of interest statement
Conflicts of Interest: None declared.
Figures


Similar articles
-
Impact of Human Mobility on COVID-19 Transmission According to Mobility Distance, Location, and Demographic Factors in the Greater Bay Area of China: Population-Based Study.JMIR Public Health Surveill. 2023 Apr 26;9:e39588. doi: 10.2196/39588. JMIR Public Health Surveill. 2023. PMID: 36848228 Free PMC article.
-
Spatiotemporal impacts of human activities and socio-demographics during the COVID-19 outbreak in the US.BMC Public Health. 2022 Aug 1;22(1):1466. doi: 10.1186/s12889-022-13793-7. BMC Public Health. 2022. PMID: 35915442 Free PMC article.
-
Checkpoint Travel Numbers as a Proxy Variable in Population-Based Studies During the COVID-19 Pandemic: Validation Study.JMIR Public Health Surveill. 2023 Aug 29;9:e44950. doi: 10.2196/44950. JMIR Public Health Surveill. 2023. PMID: 37191643 Free PMC article.
-
International travel-related control measures to contain the COVID-19 pandemic: a rapid review.Cochrane Database Syst Rev. 2021 Mar 25;3(3):CD013717. doi: 10.1002/14651858.CD013717.pub2. Cochrane Database Syst Rev. 2021. PMID: 33763851 Free PMC article.
-
Measuring mobility, disease connectivity and individual risk: a review of using mobile phone data and mHealth for travel medicine.J Travel Med. 2019 May 10;26(3):taz019. doi: 10.1093/jtm/taz019. J Travel Med. 2019. PMID: 30869148 Free PMC article. Review.
References
-
- Barbosa H, Barthelemy M, Ghoshal G, James CR, Lenormand M, Louail T, Menezes R, Ramasco JJ, Simini F, Tomasini M. Human mobility: Models and applications. Physics Reports. 2018 Mar;734:1–74. doi: 10.1016/j.physrep.2018.01.001. - DOI
-
- Tan Suoyi, Lai Shengjie, Fang Fan, Cao Ziqiang, Sai Bin, Song Bing, Dai Bitao, Guo Shuhui, Liu Chuchu, Cai Mengsi, Wang Tong, Wang Mengning, Li Jiaxu, Chen Saran, Qin Shuo, Floyd Jessica R, Cao Zhidong, Tan Jing, Sun Xin, Zhou Tao, Zhang Wei, Tatem Andrew J, Holme Petter, Chen Xiaohong, Lu Xin. Mobility in China, 2020: a tale of four phases. Natl Sci Rev. 2021 Nov;8(11):nwab148. doi: 10.1093/nsr/nwab148. https://europepmc.org/abstract/MED/34876997 nwab148 - DOI - PMC - PubMed
-
- Hou X, Gao S, Li Q, Kang Y, Chen N, Chen K, Rao J, Ellenberg JS, Patz JA. Intracounty modeling of COVID-19 infection with human mobility: assessing spatial heterogeneity with business traffic, age, and race. Proc Natl Acad Sci U S A. 2021 Jun 15;118(24):e2020524118. doi: 10.1073/pnas.2020524118. https://europepmc.org/abstract/MED/34049993 2020524118 - DOI - PMC - PubMed
-
- Schlosser F, Maier BF, Jack O, Hinrichs D, Zachariae A, Brockmann D. COVID-19 lockdown induces disease-mitigating structural changes in mobility networks. Proc Natl Acad Sci U S A. 2020 Dec 29;117(52):32883–32890. doi: 10.1073/pnas.2012326117. https://www.pnas.org/doi/abs/10.1073/pnas.2012326117?url_ver=Z39.88-2003... 2012326117 - DOI - DOI - PMC - PubMed
-
- Lu X, Tan J, Cao Z, Xiong Y, Qin S, Wang T, Liu C, Huang S, Zhang W, Marczak LB, Hay SI, Thabane L, Guyatt GH, Sun X. Mobile phone-based population flow data for the COVID-19 outbreak in Mainland China. Health Data Sci. 2021 Jun 18;2021:9796431. doi: 10.34133/2021/9796431. https://spj.science.org/doi/10.34133/2021/9796431?url_ver=Z39.88-2003&rf... - DOI - DOI - PMC - PubMed
LinkOut - more resources
Full Text Sources
Miscellaneous