PeakDecoder enables machine learning-based metabolite annotation and accurate profiling in multidimensional mass spectrometry measurements

Aivett Bilbao, Nathalie Munoz, Joonhoon Kim, Daniel J. Orton, Yuqian Gao, Kunal Poorey, Kyle R. Pomraning, Karl Weitz, Meagan Burnet, Carrie D. Nicora, Rosemarie Wilton, Shuang Deng, Ziyu Dai, Ethan Oksen, Aaron Gee, Rick A. Fasani, Anya Tsalenko, Deepti Tanjore, James Gardner, Richard D. SmithJoshua K. Michener, John M. Gladden, Erin S. Baker, Christopher J. Petzold, Young Mo Kim, Alex Apffel, Jon K. Magnuson, Kristin E. Burnum-Johnson

Research output: Contribution to journalArticlepeer-review

16 Scopus citations

Abstract

Multidimensional measurements using state-of-the-art separations and mass spectrometry provide advantages in untargeted metabolomics analyses for studying biological and environmental bio-chemical processes. However, the lack of rapid analytical methods and robust algorithms for these heterogeneous data has limited its application. Here, we develop and evaluate a sensitive and high-throughput analytical and computational workflow to enable accurate metabolite profiling. Our workflow combines liquid chromatography, ion mobility spectrometry and data-independent acquisition mass spectrometry with PeakDecoder, a machine learning-based algorithm that learns to distinguish true co-elution and co-mobility from raw data and calculates metabolite identification error rates. We apply PeakDecoder for metabolite profiling of various engineered strains of Aspergillus pseudoterreus, Aspergillus niger, Pseudomonas putida and Rhodosporidium toruloides. Results, validated manually and against selected reaction monitoring and gas-chromatography platforms, show that 2683 features could be confidently annotated and quantified across 116 microbial sample runs using a library built from 64 standards.

Original languageEnglish
Article number2461
JournalNature Communications
Volume14
Issue number1
DOIs
StatePublished - Dec 2023

Funding

This work was part of the DOE Agile BioFoundry ( http://agilebiofoundry.org ) supported by the U.S. Department of Energy, Energy Efficiency and Renewable Energy, Bioenergy Technologies Office (to K.E.B.J.), and used capabilities developed under the National Institute of General Medical Sciences grant P41 GM103493 (to R.D.S.). Microbial biomass samples from fermentations on different hydrolysates were generated as a part of the Feedstock Conversion Interface Consortium, funded by the Department of Energy Bioenergy Technologies Office.

Fingerprint

Dive into the research topics of 'PeakDecoder enables machine learning-based metabolite annotation and accurate profiling in multidimensional mass spectrometry measurements'. Together they form a unique fingerprint.

Cite this