TY - JOUR
T1 - Predicting Regional-Scale Elevated Groundwater Nitrate Contamination Risk Using Machine Learning on Natural and Human-Induced Factors
AU - Sarkar, Soumyajit
AU - Mukherjee, Abhijit
AU - Gupta, Srimanti Dutta
AU - Bhanja, Soumendra Nath
AU - Bhattacharya, Animesh
N1 - Publisher Copyright:
© 2022 American Chemical Society.
PY - 2022/4/8
Y1 - 2022/4/8
N2 - Elevated groundwater nitrate poses risk to the ecosystem and human health, and delineating the extent of elevated groundwater nitrate risk is essential for effective groundwater management and public health safety. Here, using machine learning models (Random Forest, Boosted Regression Tree, and Logistic Regression) on a large, in situ dataset, we have predicted the first nationwide extent of groundwater nitrate contamination risk (concentration >45 mg/L) across India. We also aimed to delineate the intrinsic (e.g., climate, geomorphic, hydrogeologic) and extraneous (e.g., anthropogenic input) predictors for identifying groundwater pollution risk. Of these models, Random Forest performed best and was considered to develop the final prediction map of groundwater nitrate at 1 km2 resolution. Climate variables like precipitation and aridity, and anthropogenic influence, e.g., fertilizer application and population density, were identified as the most important variables for predicting groundwater nitrate risk. Dry arid and semiarid regions in the west, south, and central parts of the country contained the majority of high-risk areas. Predictions suggested that about 37% of India's areal extent and 380 million people were exposed to elevated nitrate. The prediction model performed satisfactorily over the validation dataset that indicates the prediction ability of the model at the local scale. The study aims to provide an effective approach for identifying elevated groundwater nitrate risk and aid in the development of awareness and strategies to uphold public health safety.
AB - Elevated groundwater nitrate poses risk to the ecosystem and human health, and delineating the extent of elevated groundwater nitrate risk is essential for effective groundwater management and public health safety. Here, using machine learning models (Random Forest, Boosted Regression Tree, and Logistic Regression) on a large, in situ dataset, we have predicted the first nationwide extent of groundwater nitrate contamination risk (concentration >45 mg/L) across India. We also aimed to delineate the intrinsic (e.g., climate, geomorphic, hydrogeologic) and extraneous (e.g., anthropogenic input) predictors for identifying groundwater pollution risk. Of these models, Random Forest performed best and was considered to develop the final prediction map of groundwater nitrate at 1 km2 resolution. Climate variables like precipitation and aridity, and anthropogenic influence, e.g., fertilizer application and population density, were identified as the most important variables for predicting groundwater nitrate risk. Dry arid and semiarid regions in the west, south, and central parts of the country contained the majority of high-risk areas. Predictions suggested that about 37% of India's areal extent and 380 million people were exposed to elevated nitrate. The prediction model performed satisfactorily over the validation dataset that indicates the prediction ability of the model at the local scale. The study aims to provide an effective approach for identifying elevated groundwater nitrate risk and aid in the development of awareness and strategies to uphold public health safety.
KW - Random Forest
KW - groundwater
KW - nitrate
KW - population exposure
KW - risk
UR - http://www.scopus.com/inward/record.url?scp=85137897329&partnerID=8YFLogxK
U2 - 10.1021/acsestengg.1c00360
DO - 10.1021/acsestengg.1c00360
M3 - Article
AN - SCOPUS:85137897329
SN - 2690-0645
VL - 2
SP - 689
EP - 702
JO - ACS ES and T Engineering
JF - ACS ES and T Engineering
IS - 4
ER -