Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Sep 28;17(19):7119.
doi: 10.3390/ijerph17197119.

Groundwater Arsenic Distribution in India by Machine Learning Geospatial Modeling

Affiliations

Groundwater Arsenic Distribution in India by Machine Learning Geospatial Modeling

Joel Podgorski et al. Int J Environ Res Public Health. .

Abstract

Groundwater is a critical resource in India for the supply of drinking water and for irrigation. Its usage is limited not only by its quantity but also by its quality. Among the most important contaminants of groundwater in India is arsenic, which naturally accumulates in some aquifers. In this study we create a random forest model with over 145,000 arsenic concentration measurements and over two dozen predictor variables of surface environmental parameters to produce hazard and exposure maps of the areas and populations potentially exposed to high arsenic concentrations (>10 µg/L) in groundwater. Statistical relationships found between the predictor variables and arsenic measurements are broadly consistent with major geochemical processes known to mobilize arsenic in aquifers. In addition to known high arsenic areas, such as along the Ganges and Brahmaputra rivers, we have identified several other areas around the country that have hitherto not been identified as potential arsenic hotspots. Based on recent reported rates of household groundwater use for rural and urban areas, we estimate that between about 18-30 million people in India are currently at risk of high exposure to arsenic through their drinking water supply. The hazard models here can be used to inform prioritization of groundwater quality testing and environmental public health tracking programs.

Keywords: India; arsenic; geospatial modeling; groundwater; machine learning; random forest.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Figures

Figure 1
Figure 1
Groundwater arsenic data points and simplified geology of the Indian subcontinent. (a) Spatially averaged arsenic data points used in modeling along with topography in India and neighboring countries. (b) Lithology of the Indian subcontinent.
Figure 2
Figure 2
Arsenic hazard maps. (a) Probability of arsenic concentration in groundwater exceeding 10 µg/L. (b) High hazard areas in India based on probability cutoffs of 0.49 and 0.55.
Figure 3
Figure 3
Normalized variable importance in terms of mean decrease in accuracy and mean decrease in Gini as calculated on the test dataset. Both decrease in accuracy and decrease in Gini were normalized by their respective greatest values (see Table S1).
Figure 4
Figure 4
Correlations of predictor variables (ax) with percentages of arsenic data points exceeding 10 µg/L in 16 equally sized bins. Kendall correlations (τB) with a statistically significant p value (95% confidence level) are shown in bold.
Figure 5
Figure 5
Analyses of model performance using full modeling dataset. (a) Sensitivity and specificity were found to be equivalent at a probability cutoff of 0.49 with a corresponding accuracy of 96%. (b) Positive predictive value (PPV) and negative predictive value (NPV) were found to be equivalent at a probability cutoff of 0.55 also with a corresponding accuracy of 96%.

Similar articles

Cited by

References

    1. Rodell M., Velicogna I., Famiglietti J.S. Satellite-based estimates of groundwater depletion in India. Nature. 2009;460:999. doi: 10.1038/nature08238. - DOI - PubMed
    1. Famiglietti J.S. The global groundwater crisis. Nat. Clim. Chang. 2014;4:945–948. doi: 10.1038/nclimate2425. - DOI
    1. Polya D.A., Middleton D.R. Arsenic in drinking water: Sources & human exposure. In: Bhattacharya P., Polya D.A., Draganovic D., editors. Best Practice Guide on the Control of Arsenic in Drinking Water. 1st ed. International Water Association Publishing; London, UK: 2017. Chapter 1.
    1. Smedley P., Kinniburgh D. A review of the source, behaviour and distribution of arsenic in natural waters. Appl. Geochem. 2002;17:517–568. doi: 10.1016/S0883-2927(02)00018-5. - DOI
    1. Charlet L., Polya D.A. Arsenic in shallow, reducing groundwaters in southern Asia: An environmental health disaster. Elements. 2006;2:91–96. doi: 10.2113/gselements.2.2.91. - DOI

Publication types