Hot-spots detection in count data by Poisson assisted smooth sparse tensor decomposition
- PMID: 37808612
- PMCID: PMC10557627
- DOI: 10.1080/02664763.2022.2112557
Hot-spots detection in count data by Poisson assisted smooth sparse tensor decomposition
Abstract
Count data occur widely in many bio-surveillance and healthcare applications, e.g. the numbers of new patients of different types of infectious diseases from different cities/counties/states repeatedly over time, say, daily/weekly/monthly. For this type of count data, one important task is the quick detection and localization of hot-spots in terms of unusual infectious rates so that we can respond appropriately. In this paper, we develop a method called Poisson assisted Smooth Sparse Tensor Decomposition (PoSSTenD), which not only detect when hot-spots occur but also localize where hot-spots occur. The main idea of our proposed PoSSTenD method is articulated as follows. First, we represent the observed count data as a three-dimensional tensor including (1) a spatial dimension for location patterns, e.g. different cities/countries/states; (2) a temporal domain for time patterns, e.g. daily/weekly/monthly; (3) a categorical dimension for different types of data sources, e.g. different types of diseases. Second, we fit this tensor into a Poisson regression model, and then we further decompose the infectious rate into two components: smooth global trend and local hot-spots. Third, we detect when hot-spots occur by building a cumulative sum (CUSUM) control chart and localize where hot-spots occur by their LASSO-type sparse estimation. The usefulness of our proposed methodology is validated through numerical simulation studies and a real-world dataset, which records the annual number of 10 different infectious diseases from 1993 to 2018 for 49 mainland states in the United States.
Keywords: CUSUM; Hot-spots detection; Poisson regression; spatio-temporal model; tensor decomposition.
© 2022 Informa UK Limited, trading as Taylor & Francis Group.
Conflict of interest statement
No potential conflict of interest was reported by the author(s).
Figures







Similar articles
-
Rapid detection of hot-spots via tensor decomposition with applications to crime rate data.J Appl Stat. 2021 Jan 27;49(7):1636-1662. doi: 10.1080/02664763.2021.1874892. eCollection 2022. J Appl Stat. 2021. PMID: 35707553 Free PMC article.
-
[Visualization analysis on treatment of coronavirus based on knowledge graph].Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2020 Mar;32(3):279-286. doi: 10.3760/cma.j.cn121430-20200225-00200. Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2020. PMID: 32385990 Chinese.
-
FiSH: fair spatial hot spots.Data Min Knowl Discov. 2022 Nov 17:1-30. doi: 10.1007/s10618-022-00887-4. Online ahead of print. Data Min Knowl Discov. 2022. PMID: 36415752 Free PMC article.
-
Methodology and software to detect viral integration site hot-spots.BMC Bioinformatics. 2011 Sep 14;12:367. doi: 10.1186/1471-2105-12-367. BMC Bioinformatics. 2011. PMID: 21914224 Free PMC article.
-
Potential air toxics hot spots in truck terminals and cabs.Res Rep Health Eff Inst. 2012 Dec;(172):5-82. Res Rep Health Eff Inst. 2012. PMID: 23409510 Free PMC article.
Cited by
-
Editorial to the special issue: modern streaming data analytics.J Appl Stat. 2023 Oct 5;50(14):2857-2861. doi: 10.1080/02664763.2023.2247646. eCollection 2023. J Appl Stat. 2023. PMID: 37808613 Free PMC article. No abstract available.
References
-
- Beck A. and Teboulle M., A fast iterative shrinkage-thresholding algorithm for linear inverse problems, SIAM. J. Imaging. Sci. 2 (2009), pp. 183–202.
-
- Brègman L.M., Relaxation method for finding a common point of convex sets and its application to optimization problems, Doklady Akademii Nauk. 171 (1966), pp. 1019–1022. Russian Academy of Sciences.
-
- Chen J. and Fang F., Semiparametric likelihood for estimating equations with non-ignorable non-response by non-response instrument, J. Nonparametr. Stat. 31 (2019), pp. 420–434.
-
- Chen J., Fang F., and Xiao Z., Semiparametric inference for estimating equations with nonignorably missing covariates, J. Nonparametr. Stat. 30 (2018a), pp. 796–812.
-
- Chen J., Shao J., and Fang F., Instrument search in pseudo-likelihood approach for nonignorable nonresponse, Ann. Inst. Stat. Math. 73 (2021a), pp. 519–533.
Grants and funding
LinkOut - more resources
Full Text Sources