Integrating Climate Variable Data in Machine Learning Models for Predictive Analytics of Tomato Yields in California

Tianze Zhu, Junyi Wu, Tingyi Tan, Shuheng Wang, Thilanka Munasinghe, Heidi Tubbs, Assaf Anyamba

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Traditionally, agricultural forecasting has relied on empirical methods and basic statistical analysis, such as applying average values from previous years' yields or using a simple linear fit for next year's predictions. However, the emergence of data-driven approaches, particularly machine learning algorithms, has revolutionized yield prediction in agriculture. Machine learning techniques have demonstrated their potential to provide accurate predictions. However, existing models often rely on a limited number of input variables for crop yield predictions, which makes them only suitable for specific scenarios. In this study, we have developed four distinct machine learning-based predictors, incorporating various climate factors, including daytime temperature, nighttime temperature, precipitation (rainfall), vegetation index, and evapotranspiration as input variables to predict tomato acreage yields in counties of California, USA. Our results show that regression models constructed using neural networks and linear regression exhibited better performance than other predictors, achieving an average accuracy rate of 70% to 80%. Compared to most of the existing crop yield predictors, our models offer versatility while maintaining a desirable level of predictive accuracy. Expanding the number of input variables, such as nitrogen fertilizer usage etc, and introducing larger spatial and temporal high-resolution datasets for model training can improve our model performance, enabling us to obtain better results in tomato yield prediction.

Original languageEnglish
Title of host publicationProceedings - 2023 IEEE International Conference on Big Data, BigData 2023
EditorsJingrui He, Themis Palpanas, Xiaohua Hu, Alfredo Cuzzocrea, Dejing Dou, Dominik Slezak, Wei Wang, Aleksandra Gruca, Jerry Chun-Wei Lin, Rakesh Agrawal
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3538-3546
Number of pages9
ISBN (Electronic)9798350324457
DOIs
StatePublished - 2023
Event2023 IEEE International Conference on Big Data, BigData 2023 - Sorrento, Italy
Duration: Dec 15 2023Dec 18 2023

Publication series

NameProceedings - 2023 IEEE International Conference on Big Data, BigData 2023

Conference

Conference2023 IEEE International Conference on Big Data, BigData 2023
Country/TerritoryItaly
CitySorrento
Period12/15/2312/18/23

Funding

We acknowledge USDA for publicly sharing the tomato production data and farmland area data. NDVI data was provided by the Global Inventory Modeling and Mapping Studies (GIMMS) Group Global Agricultural Monitoring (GLAM) System - https://glam1.gsfc.nasa.gov/ funded through an interagency agreement grant between USDA/FAS/OGA and NASA/GSFC. Other NASA data portals obtained different data sets, including temperature, precipitation, and evapotranspiration data. Assaf Anyamba and Heidi Tubbs s participation in this research was enabled through Group on Earth Observations (GEO) Health Community of Practice (CoP) activities for Student engagement under the direction of Thilanka Munasinghe and Dr.Assaf Anyamba (http://www.geohealthcop.org/). We thank Mrs. Elizabeth Joyer from NASA for her guidance in locating datasets during the beginning of the study. We acknowledge USDA for publicly sharing the tomato production data and farmland area data. NDVI data was provided by the Global Inventory Modeling and Mapping Studies (GIMMS) Group Global Agricultural Monitoring (GLAM) System - https://glam1.gsfc.nasa.gov/ funded through an interagency agreement grant between USDA/FAS/OGA and NASA/GSFC. Other NASA data portals obtained different data sets, including temperature, precipitation, and evapotranspiration data. Assaf Anyamba and Heidi Tubbs’s participation in this research was enabled through Group on Earth Observations (GEO) Health Community of Practice (CoP) activities for Student engagement under the direction of Thilanka Munas-inghe and Dr.Assaf Anyamba (http://www.geohealthcop.org/). We thank Mrs. Elizabeth Joyer from NASA for her guidance in locating datasets during the beginning of the study.

FundersFunder number
OGA
National Aeronautics and Space Administration
U.S. Department of Agriculture
College of Pharmacy
Florida Academy of Sciences
Georgia Student Finance Commission

    Keywords

    • Climate Variables
    • Evapotranspiration
    • Machine Learning
    • NDVI
    • Neural Network

    Fingerprint

    Dive into the research topics of 'Integrating Climate Variable Data in Machine Learning Models for Predictive Analytics of Tomato Yields in California'. Together they form a unique fingerprint.

    Cite this