TY - GEN
T1 - From Satellites to Fields
T2 - 2023 IEEE International Conference on Big Data, BigData 2023
AU - Nangia, Saniya
AU - Munasinghe, Thilanka
AU - Tubbs, Heidi
AU - Anyamba, Assaf
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - In this work-in-progress study, we aim to determine the predictors of corn production in Iowa (United States), Heilongjiang (China), Mato Grosso (Brazil), Cordoba (Argentina), and Poltava Oblast (Ukraine). The effectiveness of predicting annual corn production using precipitation and land surface temperature (LST) values obtained from the Earth Engine tool, and Moderate Resolution Imaging Spectroradiometer (MODIS) Normalised Difference Vegetation Index (NDVI) values obtained from the National Aeronautics and Space Administration (NASA) Global Inventory Monitoring and Modelling Studies (GIMMS) Global Agricultural Monitoring System, is examined. A comparison is conducted between multiple linear regression, ridge regression, and lasso regression models. The highest adjusted R2 values are found in order to identify the optimal model in which the corn yield variance is explained by a combination of variables including the year, NDVI sample and anomaly values, precipitation, and LST. The results show that corn production values in Iowa, Heilongjiang, Mato Grosso, Cordoba, and Poltava Oblast can best be predicted using lasso regression models with adjusted R2 values of 0.841, 0.933, 0.847, 0.854, and 0.860 respectively.
AB - In this work-in-progress study, we aim to determine the predictors of corn production in Iowa (United States), Heilongjiang (China), Mato Grosso (Brazil), Cordoba (Argentina), and Poltava Oblast (Ukraine). The effectiveness of predicting annual corn production using precipitation and land surface temperature (LST) values obtained from the Earth Engine tool, and Moderate Resolution Imaging Spectroradiometer (MODIS) Normalised Difference Vegetation Index (NDVI) values obtained from the National Aeronautics and Space Administration (NASA) Global Inventory Monitoring and Modelling Studies (GIMMS) Global Agricultural Monitoring System, is examined. A comparison is conducted between multiple linear regression, ridge regression, and lasso regression models. The highest adjusted R2 values are found in order to identify the optimal model in which the corn yield variance is explained by a combination of variables including the year, NDVI sample and anomaly values, precipitation, and LST. The results show that corn production values in Iowa, Heilongjiang, Mato Grosso, Cordoba, and Poltava Oblast can best be predicted using lasso regression models with adjusted R2 values of 0.841, 0.933, 0.847, 0.854, and 0.860 respectively.
KW - Agriculture
KW - Earth Observational Data
KW - Machine Learning
KW - NDVI
KW - Yield estimation
UR - http://www.scopus.com/inward/record.url?scp=85184981212&partnerID=8YFLogxK
U2 - 10.1109/BigData59044.2023.10386225
DO - 10.1109/BigData59044.2023.10386225
M3 - Conference contribution
AN - SCOPUS:85184981212
T3 - Proceedings - 2023 IEEE International Conference on Big Data, BigData 2023
SP - 3479
EP - 3488
BT - Proceedings - 2023 IEEE International Conference on Big Data, BigData 2023
A2 - He, Jingrui
A2 - Palpanas, Themis
A2 - Hu, Xiaohua
A2 - Cuzzocrea, Alfredo
A2 - Dou, Dejing
A2 - Slezak, Dominik
A2 - Wang, Wei
A2 - Gruca, Aleksandra
A2 - Lin, Jerry Chun-Wei
A2 - Agrawal, Rakesh
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 15 December 2023 through 18 December 2023
ER -