Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan 4:10:101994.
doi: 10.1016/j.mex.2022.101994. eCollection 2023.

Determination of the best knot and bandwidth in geographically weighted truncated spline nonparametric regression using generalized cross validation

Affiliations

Determination of the best knot and bandwidth in geographically weighted truncated spline nonparametric regression using generalized cross validation

Robiansyah Putra et al. MethodsX. .

Abstract

This study proposes the development of nonparametric regression for data containing spatial heterogeneity with local parameter estimates for each observation location. GWTSNR combines Truncated Spline Nonparametric Regression (TSNR) and Geographically Weighted Regression (GWR). So it is necessary to determine the optimum knot point from TSNR and determine the best geographic weighting (bandwidth) from GWR by deciding the best knot point and bandwidth using Generalized Cross Validation (GCV). The case study analyzed the Morbidity Rate in North Sumatra in 2020. This study will estimate the model using knot points 1, 2, and 3 and geographic weighting of the Kernel Function, Gaussian, Bisquare, Tricube, and Exponential. Based on data analysis, we obtained that the best model for Morbidity Rate data in North Sumatra 2020 based on the minimum GCV value is the model using knots 1 and the Kernel Function of Bisquare. Based on the GWTSNR model, the significant predictors in each district/city were grouped into eight groups. Furthermore, the GWTSNR is better at modeling morbidity rates in North Sumatra 2020 by obtaining adjusted R-square = 96.235 than the TSNR by obtaining adjusted R-squared = 70.159. Some of the highlights of the proposed approach are:•The method combines nonparametric and spatial regression in determining morbidity rate modeling.•There were three-knot points tested in the truncated spline nonparametric regression and four geographic weightings in the spatial regression and then to determine the best knot and bandwidth using Generalized Cross Validation.•This paper will determine regional groupings in North Sumatra 2020 based on significant predictors in modeling morbidity rates.

Keywords: Geographically Weighted Truncated Spline Nonparametric Regression (GWTSNR); Kernel function; Morbidity rate; Nonparametric regression; Spatial regression.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Image, graphical abstract
Graphical abstract
Fig 1
Fig. 1
Morbidity Rates in Indonesia and North Sumatra 2015 – 2020. It can be seen that for a period of six years, from 2015 to 2020, the morbidity rates in North Sumatra Province were consistently below the national figure. All variables ranging from the response variable to the seven predictor variables that are thought to affect the average, variance to the minimum, and maximum values are calculated.
Fig 2
Fig. 2
Description information based on variable data mapping. The lowest Y is in Humbang Hasundutan Regency, and the highest Y is in Batubara Regency. The lowest X1is in Deli Serdang Regency, and the highest X1 is in West Nias Regency. The lowest X2 is in South Nias Regency, and the highest X2 is in Binjai City. The lowest X3 is in Pakpak Bharat Regency, and the highest X3 is in Medan City. The lowest X4 is in Humbang Hasundutan Regency, and the highest X4 is in Gunungsitoli City. The lowest X5is in West Nias Regency, and the highest X5 is in Medan City. The lowest X6 is in Padang Sidimpuan Regency, and the highest X6 is Pematangsiantar City. And the lowest X7 is Nias, and the highest X7 is in Medan City.
Fig 3
Fig. 3
Scatterplot between Morbidity Rate and Predictors. The plot between the variable morbidity rate with all predictors does not form or follow a certain pattern. So that all predictors are included in nonparametric components.
Fig 4
Fig. 4
Mapping Morbidity Rates in North Sumatra 2020 based on Significant Variables.

Similar articles

Cited by

References

    1. Ayeni O. The importance of morbidity statistics in the evaluation of public health in Africa. Jimlar Mutane. 1976;1(2):193–197. - PubMed
    1. The health office in North Sumatra. North Sumatra provincial health office performance report, 2020 online at http://dinkes.sumutprov.go.id/.
    1. Cressie N.A.C. John Wiley and Sons; New York: 1991. Statistics for Spatial Data; pp. 803–872.
    1. LeSage J.P. Regression analysis of spatial data. J. Reg. Policy. 1997;27(2):83–94.
    1. Fotheringham A.S., Brunsdon C., Charlton M. John Wiley and Sons Ltd; England: 2002. Geographically Weighted Regression: The Analysis of Spatially Varying Relationships; pp. 65–102.

LinkOut - more resources