Few-Shot Learning Enables Population-Scale Analysis of Leaf Traits in Populus trichocarpa

John Lagergren, Mirko Pavicic, Hari B. Chhetri, Larry M. York, Doug Hyatt, David Kainer, Erica M. Rutter, Kevin Flores, Jack Bailey-Bale, Marie Klein, Gail Taylor, Daniel Jacobson, Jared Streich

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

Plant phenotyping is typically a time-consuming and expensive endeavor, requiring large groups of researchers to meticulously measure biologically relevant plant traits, and is the main bottleneck in understanding plant adaptation and the genetic architecture underlying complex traits at population scale. In this work, we address these challenges by leveraging few-shot learning with convolutional neural networks to segment the leaf body and visible venation of 2, 906 Populus trichocarpa leaf images obtained in the field. In contrast to previous methods, our approach (a) does not require experimental or image preprocessing, (b) uses the raw RGB images at full resolution, and (c) requires very few samples for training (e.g., just 8 images for vein segmentation). Traits relating to leaf morphology and vein topology are extracted from the resulting segmentations using traditional open-source image-processing tools, validated using real-world physical measurements, and used to conduct a genome-wide association study to identify genes controlling the traits. In this way, the current work is designed to provide the plant phenotyping community with (a) methods for fast and accurate image-based feature extraction that require minimal training data and (b) a new population-scale dataset, including 68 different leaf phenotypes, for domain scientists and machine learning researchers. All of the few-shot learning code, data, and results are made publicly available.

Original languageEnglish
Article number0072
JournalPlant Phenomics
Volume5
DOIs
StatePublished - 2023

Funding

The authors would like to acknowledge members of the Taylor Lab (University of California, Davis): Z. (Janna) Meng and A. Zhu, for their support during data collection. Funding: This research used resources of the Oak Ridge Leadership Computing Facility, which is a Department of Energy (DOE) Office of Science User Facility supported under Contract DE-AC05-00OR22725. This work was funded by the Artificial Intelligence (AI) Initiative, an ORNL Laboratory Directed Research and Development program, and by the Center for Bioenergy Innovation (CBI), which is a U.S. DOE Bioenergy Research Center supported by the Office of Biological and Environmental Research in the DOE Office of Science. The sequencing work was conducted by the U.S. DOE Joint Genome Institute, a DOE Office of Science User Facility, and is supported by the Office of Science of the U.S. DOE operated under Contract No. DE-AC02-05CH11231. The manuscript was authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. DOE. The U.S. Government retains and the publisher, by accepting the article for publication, acknowledges that the U.S. Government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript, or allow others to do so, for U.S. Government purposes. The DOE will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan). Author contributions: J.L.: Conceptualization, funding acquisition, data collection, segmentation, baseline comparison, feature extraction, and writing. M.P.: Conceptualization, data collection, feature extraction, and writing, H.B.C.: Conceptualization, data collection, genomic analysis, and writing. L.M.Y.: Conceptualization, feature extraction, and writing. D.H.: Genomic analysis and writing. D.K.: Genomic analysis and writing. E.M.R.: Segmentation and writing. K.F.: Segmentation and writing. J.B.-B.: Field site support and writing. M.K.: Field site support and writing. G.T.: Field site support and writing. D.J.: Conceptualization, funding acquisition, supervision, and writing. J.S.: Conceptualization, funding acquisition, data collection, and writing. Competing interests: The authors declare that they have no competing interests.

FundersFunder number
Artificial Intelligence
DOE Public Access Plan
ORNL Laboratory Research and Development Program
U.S. Government
U.S. Department of Energy
Office of ScienceDE-AC05-00OR22725
Biological and Environmental Research
Center for Bioenergy Innovation
Joint Genome InstituteDE-AC02-05CH11231

    Fingerprint

    Dive into the research topics of 'Few-Shot Learning Enables Population-Scale Analysis of Leaf Traits in Populus trichocarpa'. Together they form a unique fingerprint.

    Cite this