Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Apr 1;24(1):81.
doi: 10.1186/s12874-024-02204-7.

Overcoming denominator problems in refugee settings with fragmented electronic records for health and immigration data: a prediction-based approach

Affiliations

Overcoming denominator problems in refugee settings with fragmented electronic records for health and immigration data: a prediction-based approach

Stella Erdmann et al. BMC Med Res Methodol. .

Abstract

Background: Epidemiological studies in refugee settings are often challenged by the denominator problem, i.e. lack of population at risk data. We develop an empirical approach to address this problem by assessing relationships between occupancy data in refugee centres, number of refugee patients in walk-in clinics, and diseases of the digestive system.

Methods: Individual-level patient data from a primary care surveillance system (PriCarenet) was matched with occupancy data retrieved from immigration authorities. The three relationships were analysed using regression models, considering age, sex, and type of centre. Then predictions for the respective data category not available in each of the relationships were made. Twenty-one German on-site health care facilities in state-level registration and reception centres participated in the study, covering the time period from November 2017 to July 2021.

Results: 445 observations ("centre-months") for patient data from electronic health records (EHR, 230 mean walk-in clinics visiting refugee patients per month and centre; standard deviation sd: 202) of a total of 47.617 refugee patients were available, 215 for occupancy data (OCC, mean occupancy of 348 residents, sd: 287), 147 for both (matched), leaving 270 observations without occupancy (EHR-unmatched) and 40 without patient data (OCC-unmatched). The incidence of diseases of the digestive system, using patients as denominators in the different sub-data sets were 9.2% (sd: 5.9) in EHR, 8.8% (sd: 5.1) when matched, 9.6% (sd: 6.4) in EHR- and 12% (sd 2.9) in OCC-unmatched. Using the available or predicted occupancy as denominator yielded average incidence estimates (per centre and month) of 4.7% (sd: 3.2) in matched data, 4.8% (sd: 3.3) in EHR- and 7.4% (sd: 2.7) in OCC-unmatched.

Conclusions: By modelling the ratio between patient and occupancy numbers in refugee centres depending on sex and age, as well as on the total number of patients or occupancy, the denominator problem in health monitoring systems could be mitigated. The approach helped to estimate the missing component of the denominator, and to compare disease frequency across time and refugee centres more accurately using an empirically grounded prediction of disease frequency based on demographic and centre typology. This avoided over-estimation of disease frequency as opposed to the use of patients as denominators.

Keywords: Asylum seekers; Disease prevalence; Generalized linear models; Patient-registries; Prediction; Refugees.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Observations and predictions on relationship 2: patient-occupancy ratio when occupancy data is given. Observed (indicated by circles) and predicted (indicated by diamonds) ratio formula image and number of patients formula image on the basis of the matched data set and the model of Sect. 2.3.2 for the unmatched occupancy dataset, respectively. Observations/predictions of different facilities are indicated by the colours of the small dots inside the circles/diamonds, which are coloured with respect to the type of centre (REC: black, REG: blue). Analogues figures also indicating 95% confidence intervals for the observed estimates/ predictions can be found in Appendix A.4 (compare Figure A1)
Fig. 2
Fig. 2
Observations and predictions on relationship 3: patient-occupancy ratio when patient data is given. Observed (indicated by circles) and predicted (indicated by diamonds) ratio formula image and occupancy number formula image on the basis of the matched data set and the model of Sect. 2.3.3 for the unmatched EHR dataset, respectively. Observations/predictions of different centres are indicated by the colours of the small dots inside the circles/diamonds, which are coloured with respect to the type of centre (REC: black, REG: blue). Analogous figures with 95% confidence intervals for the observed estimates/predictions can be found in Appendix A.4 (compare Figure A2)
Fig. 3
Fig. 3
Observations and predictions on relationship 1 and 2: disease incidence depending on total population at risk (occupancy). Observed (indicated by circles) and predicted (indicated by diamonds) disease incidence with respect to occupancy number (i.e., number of incident cases of diseases of the digestive system divided by the occupancy number) on the basis of the EHR data set and the models of Sect. 2.3.2 (relationship 2: patient-occupancy ratio when occupancy data is given) and Sect. 2.3.1 (relationship 1: disease incidence) for the unmatched occupancy dataset, respectively. Observations/predictions of different facilities are indicated by the colours of the small dots inside the circles/diamonds, which are coloured with respect to the type of centre (REC: black, REG: blue). An analogous figure with 95% confidence intervals for the observed estimates/ predictions can be found in Appendix A.4 (compare Figure A3)

Similar articles

References

    1. Brenner H. 4 The Denominator Problem-A Literature Review. Comparison and Harmonisation of Denominator Data for Primary Health Care Research in Countries of the European Community: The European Denominator Project. 351999. p. 13.
    1. Schlaud M, Brenner MH, Hoopmann M, Schwartz F. Approaches to the denominator in practice-based epidemiology: a critical overview. J Epidemiol Commun Health (1979-). 1998:S13–9. - PubMed
    1. Mayo F, Marsland D, Wood M, Mosteller M, Miller GW, Johnson RE, et al. Denominator Definition by the utilization correction factor method. Fam Pract. 1986;3(3):184–91. doi: 10.1093/fampra/3.3.184. - DOI - PubMed
    1. Cherkin DC, Berg AO, Phillips WR. In search of a solution to the primary care denominator problem. J Fam Pract. 1982;14(2):301–9. - PubMed
    1. Morrison CN, Rundle AG, Branas CC, Chihuri S, Mehranbod C, Li G. The unknown denominator problem in population studies of disease frequency. Spat Spatio-temporal Epidemiol. 2020;35:100361. doi: 10.1016/j.sste.2020.100361. - DOI - PMC - PubMed

LinkOut - more resources