Abstract
Background: Root system architecture (RSA) is of growing interest in implementing plant improvements with belowground root traits. Modern computing technology applied to images offers new pathways forward to plant trait improvements and selection through RSA analysis (using images to discern/classify root types and traits). However, a major stumbling block to image-based RSA phenotyping is image label noise, which reduces the accuracies of models that take images as direct inputs. To address the label noise problem, this study utilized an artificial intelligence model capable of classifying the RSA of alfalfa (Medicago sativa L.) directly from images and coupled it with downstream label improvement methods. Images were compared with different model outputs with manual root classifications, and confident machine learning (CL) and reactive machine learning (RL) methods were tested to minimize the effects of subjective labeling to improve labeling and prediction accuracies. Results: The CL algorithm modestly improved the Random Forest model’s overall prediction accuracy of the Minnesota dataset (1%) while larger gains in accuracy were observed with the ResNet-18 model results. The ResNet-18 cross-population prediction accuracy was improved (~8% to 13%) with CL compared to the original/preprocessed datasets. Training and testing data combinations with the highest accuracies (86%) resulted from the CL- and/or RL-corrected datasets for predicting taproot RSAs. Similarly, the highest accuracies achieved for the intermediate RSA class resulted from corrected data combinations. The highest overall accuracy (~75%) using the ResNet-18 model involved CL on a pooled dataset containing images from both sample locations. Conclusions: ResNet-18 DNN prediction accuracies of alfalfa RSA image labels are increased when CL and RL are employed. By increasing the dataset to reduce overfitting while concurrently finding and correcting image label errors, it is demonstrated here that accuracy increases by as much as ~11% to 13% can be achieved with semi-automated, computer-assisted preprocessing and data cleaning (CL/RL).
Original language | English |
---|---|
Article number | 0251 |
Journal | Plant Phenomics |
Volume | 6 |
DOIs | |
State | Published - Jan 2024 |
Funding
The authors express their sincere thanks to the editor and reviewers for their valuable suggestions and comments. Funding:ThisstudywasfundedbytheU.S.DepartmentofAgriculture (Agricultural Research Service project 5062-12210-004-D and 2020-67021-32460). L.M.Y. was funded by the Center for Bioenergy Innovation (CBI), U.S. Department of Energy, Office of Science, Biological and Environmental Research Program under Award Number ERKP886. This manuscript has been authored in part by UT-Battelle, LLC that manages Oak Ridge National Laboratory under contract DE-AC05-00OR22725 with the U.S. Department of Energy (DOE). The publisher acknowledges the U.S. government license to provide public access under the DOE Public Access Plan (https://energy.gov/downloads/doe-public-access-plan). The authors express their sincere thanks to the editor and reviewers for their valuable suggestions and comments. Funding:ThisstudywasfundedbytheU.S.DepartmentofAgriculture (Agricultural Research Service project 5062-12210-004-D and 2020-67021-32460). L.M.Y. was funded by the Center for Bioenergy Innovation (CBI), U.S. Department of Energy, Office of Science, Biological and Environmental Research Program under Award Number ERKP886. This manuscript has been authored in part by UT-Battelle, LLC that manages Oak Ridge National Laboratory under contract DE-AC05-00OR22725 with the U.S. Department of Energy (DOE). The publisher acknowledges the U.S. government license to provide public access under the DOE Public Access Plan (https://energy.gov/ downloads/doe-public-access-plan). Author contributions: B.J.W.: Data analysis, writing original draft text and graphics, review and editing, and visualization. Z. Tang: Data analysis, writing original draft text and graphics, review and editing, and visualization. Z. Tian: Review and editing. D.J.H.: Writing original draft text, review and editing, and funding acquisition. T.H.T.: Review and editing. A.S.: Review and editing. Zhou Zhang: Review and editing. L.M.Y.: Data acquisition, review and editing, and funding acquisition.
Funders | Funder number |
---|---|
DOE Public Access Plan | |
Center for Bioenergy Innovation | |
U.S. Department of Energy | |
Office of Science | |
Biological and Environmental Research program | ERKP886 |
Oak Ridge National Laboratory | DE-AC05-00OR22725 |